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(57) Abstract: The present invention provides novel cell wall anchor pro- 
teins derived from yeast, genes thereof and the genetic method for teth- 
ering polypeptide to the yeast cell wall using the same. Particularly, the 
present invention provides the novel GPl(glycosyl phosphatidyl inositol)- 
anchor protein genes, SEDl, GASl, TIP I and CWPI, and their proteins. 
PIR2 cell wall protein gene and its protein derived from Hansenula pofy- 
morpha, and the cell surface expression system using them which im- 
mobilize foreign enzymes or polypeptides on the cell wall of a microbial 
cell. In addition, the present invention provides the cell surface expression 
system using WSCl gene and it s protein derived from yeasts, including 
Hansenula polymorpha and Saccharomyces cerevisiae, and 57^7 gene 
and its protein derived from Saccharomyces diastaticus. The cell surface 
expression system of the present invention expects an immobilization ef- 
fect as biocatalysts by adhereing a desired protein to the cell surface, and 
provides a means of altering target protein characteristics such as binding 
affinity and stability by library screening. 
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INTEllNATIONAI. FORM 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT 

issued pursuuni to Rule 7.1 

IX) ■■ CIIOl. Eui-Sung 

Dasol Apt. 102-507, #395-3. Kung-dong. Yusong-ku, Taejon 30!i-;«J5. 
l^epublic of Korea/ 



I . IDENTIFICATION OI- THE MICROORGANISM 



Identification reference given by the 
niiFOSITOR: 



Escherichia coli 
DII5@/IlpPn{2 



Accession number given by the 
INTERNATIONAL Ui-IOSITARY 
AUTHORITY: 

KCTC 0827BP 



n . SCIEWl-lFIC DESCRIPTION AND/OR PROTOSED TAXONOMIC DESIGNATION 



The microorganism identified under I above was accompanied by: 

[ X ] a scientific description 

[ J a proposed taxonomic designation 

IMarit with a cross where applicable) 



m. RECEIPT AND ACCEPTANCE 



This International Depositary Authority accepts the microorganism identiliL-d undei 1 ahtive. 
which was received by it on July 11 2000. 



IV. RUCEIPT OF REQUEST FOR CONVERSION 



The microorganism identified under I above was received by this IntematiimaJ Depositary 
Authority on and a request to convert the original dt'|X«it to a deposit 

under the Budapest Treaty was received by it on 



V. INTERNATIONAL DEPOSITARY AUTHORITY 



Name: Korean Collection for Type Cultures 



Korea Research Institute of 
liioscience and Biotechnology 
(KRIBB) 

SS2. Oun-dong. Yusong-ku, 
Taejon 305-333. 
Republic of Korea 



Signature(s) of person(s) having the ix)vvi;r 
to represent the International Deiwsitary 
Authority of authorized officiaKs): | 



his 



BAE, Kyung Sook, IJirector 
Date: July 15 2000 
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IN'lERNA'nONAL FOllM 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT 

issued pursuant to Itule 7.1 

IX) : CHOI. Eui-Suns 

Uasol Apt. i 02-507. #;i95-3, Kung-dong, Yusong-ku, Taejon :^{)5-:^35. 
Republic of Korea/ 



1 . IDENTIFICATION OF THE MICROORGANISM 



Identificiition reference given bv the 
DEPOSITOU: 



Escherichia coli 
I)H5@/HpGASI 



Accession number given by the 
INTERNATIONAL DKTOSITAKY 
AUTHORITY; 

KCTC 0828BP 



D. SCIENTIFIC DESCRIFriON AND/OR PROPOSED TAXONOMIC Dl-SIGNATJON 



The microorganism identified under I above was accompanied by: 

[ X ] a scientific description 

[ ] a proposed taxonomic designation 

(Mark with a cross where applicable) 



:. RECEIPT AND ACCEPTANCE 



This International Depositary Authority accepts the microorganism identified under I abovf. 
which was received by it on July 11 2000. 



IV. RKClvIFf OF REQUEST TOR CONVERSION 



The microorganism identified under 1 above was received by this International Dei»>sitary 
Authority on and a request to convert the original deposit to a depo.su 

under the Budapest Treaty was received by it on 



V. INTERNATIONAL DEPOSITARY AUTHORITY 



Name: Korean Collection for Type Cultures 



Address: Korea Research Institute of 
Bioscience and Biotechnology 
(KRIBB) 

#52, Oun-dong, Yusong-ku, 
Taejon 305-333. 
Republic of Korea 



Signature(s) of person(s) having the powei 
to represent the International Depositary 
Authority of authorized officiaKs): 



BAE, Kyung Book, Director 
Date: July 15 2000 
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'What: xs Claimed is 

1. A novel cell wall protein gene HpSEDl and its 
hoinologues represented by the SEQ. ID. NO: 1, derived 

5 from Hansenula polymorpha. 

2. An E. coll transformant (Accession No. KCTC 

0825BP) transformed with a recombinant vector 
containing 4 kb EcoBl fragment of HpSEDl gene of claim 
10 1. 

3. A novel cell wall protein gene HpPIR2 and its 
homologues represented by the SEQ. ID. NO: 4, derived 
from Hansenula polymorpha. 

15 

4. An E. coll transformant (Accession No. KCTC 
0827BP) transformed with a recombinant vector 
containing 5.5 kb Sail fragment of HpPIR2 gene of 
claim 3. 

20 

5. A novel cell wall protein gene HpGASl and its 
homologues represented by the SEQ. ID. NO: 5, derived 
from Hansenula polymorpha . 

25 6. An E. coll transformant (Accession No. KCTC 

0828BP) transformed with a recombinant vector 
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containing 3 kb PstI fragment of HpGASl gene of claim 
5. 

7. A novel cell wall protein gene HpTIPl and its 
5 homologues represented by the SEQ. ID. NO: 6, derived 

from Hansenula polymorpha . 

8. An E. coli transformant (Accession No. KCTC 
0824BP) transformed with a recombinant vector 

10 containing 3.5 kb Xbal fragment of HpTIPl gene of 
claim 7 . 

9. A novel cell wall protein gene HpCWPl and its 
homologues represented by the SEQ. ID. NO: 7, derived 

15' from Hansenula polymorpha. 

10. An E. coli transformant (Accession No. KCTC 
0826BP) transformed with a recombinant vector 
containing 6 kb Sail fragment of HpCWPl gene of claim 

20 9. 

11. A surface expression system for expressing 
foreign polypeptides or proteins on the surface of a 
cell, wherein a polypeptide or its partial fragment, 

25 encoded by the DNA sequence of claim 1, 3, 5, 7 and 9, 
its homologues or its partial fragments, is used as an 

75 
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expression mediator which localizes the foreign 
polypeptides or proteins onto the cell surface. 

12. A surface expression system for expressing 
5 foreign polypeptides or proteins on the surface of a 

cell, wherein a polypeptide or its partial fragment, 
encoded by an isolated DNA derived from Saccharomyces 
diastaticus, containing the base sequence STAl 
represented by the SEQ. ID. NO: 9 or its partial 
10 fragment, is used as an expression mediator which 
localizes the foreign polypeptides or proteins onto 
the cell surface. 

13. A surface expression system for expressing 
15 foreign polypeptides or proteins on the surface of a 

cell, wherein a polypeptide or its partial fragment 
encoded by HpWSCl derived from H. polymorpha, 
containing the base sequence represented by the SEQ. 
ID. NO: 8, or a polypeptide or its partial fragment 
20 encoded by the base sequence WSCl derived from S. 
cerevisiae, is used as an expression mediator which 
localizes the foreign polypeptides or proteins onto 
the cell surface. 

25 14. The surface expression system as set forth in 

any of claims 11 to 13 , wherein the cell is a species 
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selected from the group consisting of yeast spp,, 
including Candida spp., Debaryomyces spp., Hansenula 
spp., Kluyveromyces . spp., Pichia spp., 

Schizosaccharomyces spp., Yarrowia spp., Saccharomyces 
5 spp.; and moid spp., including Aspergillus spp., 
Penicillium spp., and Rhizopus spp. 
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AMENDED CLAIMS 
[received by the International Bureau on 26 November 2001 (26.1 1.01); 
original claims 1-14 replaced by amended claims 1-19 (4 pages)] 

1 . A novel cell wall protein gene HpSEDl and its 
homologues represented by . the SEQ. ID. NO: 1, derived 

5 from Hansenula polymorpha. 

2. (added) A novel cell wall protein HpSedlp and 
its homologues represented by the SEQ. ID. NO: 11, 
encoded by the HpSEDl gene of claim 1 . 

10 

3. An E. coli transformant (Accession No. KCTC 
0825BP) transformed with a recombinant vector 
containing 4 kb EcoRl fragment of HpSEDl gene of claim 
1. 

15 

4. A novel cell wall protein gene HpPIR2 and its 
homologues represented by the SEQ. ID. NO: 4, derived 
from Hansenula polymorpha. 

20 5. (added) A novel cell wall protein HpPir2p and 

its homologues represented by the SEQ. ID. NO: 12, 
encoded by the HpPIR2 gene of claim 4 . 

6. An E. coli transformant (Accession No. KCTC 
25 0827BP) transformed with a recombinant vector 
containing 5.5 kb Sail fragment of HpPIR2 gene of 
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7. A novel cell wall protein gene HpGASl and its 
homologues represented by the SEQ. ID. NO: 5, derived 

5 from Hansenula polymorpha. 

8. (Added) A novel cell wall protein HpGaslp and 
its homologues represented by the SEQ. ID. NO: 14, 
encoded by the HpGASl gene of claim 7 . 

10 

9. An E. coli transformant (Accession No. KCTC 
0828BP) transformed with a recombinant vector 
containing 3 kb Pstl fragment of HpGASl gene of claim 
7. 

15 

10. A novel cell wall protein gene HpTIPl and its 
homologues represented by the SEQ. ID. NO: 6, derived 
from Hansenula polymorpha. 

20 11. (added) A novel cell wall protein HpTiplp and 

its homologues represented by the SEQ. ID. NO: 13, 
encoded by the HpTIPl gene of claim 10. 

12. An E. coli transformant (Accession No. KCTC 
25 0824BP) transformed with a recombinant vector 
containing 3.5 kb Xbal fragment of HpTIPl gene of 
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13. A novel cell wall protein gene HpCWPl and its 
homologues represented by the SEQ. ID. NO: 1, derived 

5 from Hansenula polymorpha. 

14. (added) A novel cell wall protein HpCwplp and 
its homologues represented by the SEQ. ID. NO: 15, 
encoded by the HpCWPl gene of claim 13. 

10 

15. An E. coli transformant (Accession No. KCTC 
082 6BP) transformed with a recombinant vector 
containing 6 kb Sail fragment of HpCWPl gene of claim 
13. 

15 

16. A surface expression system for expressing 
foreign polypeptides or proteins on the surface of a 
cell, wherein a polypeptide or its partial fragment, 
encoded by the DNA sequence of claim 1, 4, 7, 10 and 

20 13, its homologues or its partial fragments, is used 
as an expression mediator which localizes the foreign 
polypeptides or proteins onto the cell surface. 

17. A surface expression system for expressing 
25 foreign polypeptides or proteins on the surface of a 

cell, wherein a polypeptide or its partial fragment, 
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encoded by an isolated DNA derived from Saccharomyces 
diastatxcus, containing the base sequence STAl 
represented by the SEQ. ID. NO: 9 or its partial 
fragment, is used as an expression mediator which 
5 localizes the foreign polypeptides or proteins onto 
the cell surface. 

18. A surface expression system for expressing 
foreign polypeptides or proteins on the surface of a 

10 cell, wherein a polypeptide or its partial fragment 
encoded by HpWSCl derived from H. polymorpha, 
containing the base sequence represented by the SEQ. 
ID. NO: 8, or a polypeptide or its partial fragment 
encoded by the base sequence WSCl derived from S. 

15 cerevislae, is used as an expression mediator which 
localizes the foreign polypeptides or proteins onto 
the cell surface. 

19. The surface expression system as set forth in 
20 any of claims 16 to 18, wherein the cell is a species 

selected from the group consisting of yeast spp., 
including Candida spp., Debaryomyces spp., Hansenula 
spp., Kluyveromyces spp., Pichia spp., 

Schlzosaccharomyces spp., Yarrowia spp., Saccharomyces 
25 spp.; and mold spp., including Aspergillus spp., 
Penicillium spp., and Rhizopus spp. 
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FIG. 3 
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FIG. 4 
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FIG. 5 
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FIG. 6 
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FIG. 7 
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FIG. 8 
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FIG. 9 
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FIG. 10a 
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FIG. 12 
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FIG. 13 
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FIG. 15 
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SEQUENCE LISTING 

<1 10> Korea Research Institute of Bioscience and Biotechnology 

<120> Novel cell wall anchor proteins derived from yeast, genes thereof 
and cell surface expression systems using the same 

<130> OFPO-07-20 

<160> 15 

<170> Kopatentin 1.55 

<210> 1 

<211> 396 

<212> DNA 

<213> Hansenula polymorpha 



<400> 1 

atgcaattca gaactttggc tccacttgct ttggcttccg ctgctttcgc tgcttactct 60 

aacggcaccg tctccaccat tacttacgag accaccgttt ctgaggtcgt tactgctttg 120 

actacctact gcccagaagc cacctctatc gtcaccaacg gaaagaccta cactgtcact 180 

ggtgccacca ccttgaccat caccgactgc ccatgcacta agaagaaggt catcaccacc 240 

accactgtca ccaccatccc agctaaatct tccactgctg cttcctctgt tgctgcttcc 300 

tctgctccag tcatctccac tgccgagaac gctggtgcta aggttggtgc tgctggtttg 360 

gctgcccttg ccggtgctgc tgctttcttg ctctaa 396 

1 
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<210> 2 

<211> 9 

<212> PRT 

<213> Hansenula polymorpha 

<400> 2 

Ser Gin He Gly Asp Gly Gin He Gin 
1 5 



<210> 3 

<211> 13 

<212> PRT 

<213> Hansenula polymorpha 

<400> 3 

Gly Val Val Xaa Gin He Gly Asp Gly Gin He Gin Ala 
1 5 10 



<210> 4 
<211> 1014 
<212> DNA 

<213> Hansenula polymorpha 

<400> 4 ... 
atgaagttca catcctcgct cgctgccatc agtttggcct ccaacgcatt cgctgcctat 60 

gttggctcta cctactggac taccatgacc ccatcctaca ccctggatgg tgctctgact 120 
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agctactcgg ctactttcgg tattgctgtt gaaccactag agaccagttc ctcggtttcc 180 

gcctctctca acgttgagaa aagacaggtt gtttctcaaa ttggtgatgg tcaaattcag 240 

gctaccacga acaccgagaa agaaacctcc aaatcctcta cttctactgc cgcagctgtt 300 

gtgtcgcaaa tcacggacgg tcaaatccaa gccaccactg ccaccaccac ctcttcttca 360 

tcgagctcca agaagactgc cgcagctgtt gtcactcaaa ttggsgacgg tcaaatccaa 420 

gctaccacct ccacttcttc caagagcact gctgctgacg ttgttaccca aatcggcgat 480 

ggtcagatcc aagccaccag caagtcgtca tccacttcca ctgctgctga cgttgtgtct 540 

cagatcactg acggccagat ccaagctacc accagcacca aggcctcttc tgccaccacc 600 

agcggtgtga tctcccagat ctccgacggt caaatccagg catcttccac cgcttcttcg 660 

aagacctcca ccgcttcctc atccactgca actggagact acgtcacctc tgtgtcctgt 720 

aagaaggagg gtgctctggc catgactttg aaggacggta tcctgtatga ctcggaggga 780 

agaattggct ctatcgttgc taacagacaa ttccaattcg acggtcctcc accacaagct 840 

ggtgccatct atgctgacgg atggtccatt tccccagacg gatacctggc cattggtaac 900 

gacaccatat tctaccagtg tctgtcgggc accttctaca acttgtacga ccagtcgatt 960 

ggaggccaat gtaataaggt ccacttgaag gctgtcgagt tggtcgactg ttag 1014 
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<210> 5 

<211> 1614 

<212> DNA 

<213> Hansenula polymorpha 



<400> 5 

atgcagctaa aatctatcct ctcgctcacg ggcctgctct ccaccacgct ggctctgcct 60 

accattgacg tggtgtccaa caagttcttc tattcgaaca acgggtccca attctacgtc 120 

aaaggtgtcg cgtaccagaa gaacacagaa aatgctaccg acgacgcaac ttatgtcgat 180 

ccgctcgctg acgaagattc gtgcaagaga gacatcccgt acttacagaa ccttggcatc 240 

aatgttatcc gggtgtatgc agtcgatgcc tccaaggacc acgacggatg catgtctctg 300 

ctcgaggatg ctggaatcta tgtgatctcc gatctttcga ccccaaacga gtcgatcgag 360 

accaccagtc cgtcctggac tgtcgatctg tacaacagat atgccacggt gatcgatatg 420 

ttccaaagct acgacaacgt gctcggcttt tttgcaggta atgaggttat caccaacaag 480 

accaacagtg acgctgctcc gttcgtcaag gccgctatca gagacatgaa gcagtacatg 540 

aaggacaaca actacagaga cattccgatc ggctactcgg ccaacgacga tgccaacacc 600 

agagttccgt. ctgcggacta cttctcctgt ggaaacgacg acgtcaaggc agacttttac 660 

ggtatcaaca tgtacgagtg gtgtggaaat gccacgttct caagctccgg ctacgaggcg 720 
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agaacgaagg aattttccaa tttgacgatc ccgatctttt tctcagagta cggttgcaac 780 

agcgtcaagc cacgtgagtt cacagaggtg caagctatct actccgatga aatgacagac 840 

gtttggtccg gcggtatcgt gtacatgtac ttccaggaag agaacgacta cggcctcgtt 900 

tccatcaaag acaatgctgt ctcgactttg ggcgactaca ctaacctcaa gagtgagctc 960 

gcgaaaatta gccctaccac ggcgtctgcc tctgctgcat cgcagtctgc cacagaattg 1020 

agttgtccaa ccagccagag caactggaag gcatccacag accttcctcc aactccaaat 1080 

gaggccgtgt gcgactgtct tgagtcgtcg ctcaaatgtg tcgtctccga cagcgtcgac 1140 

tccgacgact acggcgactt cttcggtatc gtctgcgatc tcaccgactg ttctcaaatt 1200 

tccaccagcg gcagtaacgg ctcctacggc gcatactcgt actgctcagc caaggacaag 1260 

ctttcgttcc tgctcaacaa atactacgag gaacaggact ccaactcgtc ggcttgcgac 1320 

ttcagcggct ctgcctcgct caataccaac ggctcgacag catccagctg ctcttcctta 1380 

ttgagctctg cctcggcctc tccatcggcc actggctcct caaactcctc ccctgcatct 1440 

ggctccggct ctagttccgg ctccagctcc ggctctggct ccagcagctc cagctcctcg 1500 

tcgtcgtctg ccggcgcggg tgtcaacgct gtcccattgt cggccccaca attgggcctg 1560 

ctctccttgt tctccacctt cttcttgggc ggactctcct acatctttat ctag 1614 
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<210> 6 

<211> . 852 
<212> DNA 

<213> Hansenula polymorpha 



<400> 6 

atgctctcgt ttaaaactct ttgcgctaca gctttcgctc ttactgccgt ttctgctgcc 60 

cctcaggctg acccatttgc ttttgctaat gcaatcggac ttccagttgc agcagaggca 120 

acttacgctt gtcatgcctc ttgtggttac gctattttag ctgctcgtca atgttcccca 180 

actggctccg aggacgctaa ctacaactcc acctgtctgt gtgcttccga cagtcaattc 240 

ttgtcctacg tcccagcctg ccttgactgt ggatggtgtc tgtggagtga ctacggatcc 300 

ttcttgacct ctgctttggc tgaatgtcac accaacaccc agccaactgg tactacttgt 360 

gccccaagca ctgctcaagc cgctgctacc tcttctgttg ctgctgcagc cagtgaagtc 420 

tcttcttcgt cggctgctgc ttcttcaact caggctgccg ctgctgcttc cacttctgct 480 

gctgcctcga ctgaggccac tacttccgct gctgctgctg ctacttcttc ctctgaggct 540 

gctagctcat ctgcccatgt tcatagccat gctgctgagt ccacctccgc tgttgagtcc 600 

acttccgctg ctcactccca tgctgctgag tcgagctccg ctgctcattc ccatgctgtc 660 

gaatccagct ccgctgctca tgttcattcc catgccgccg aatccagctc cgctgctcac 720 

agccacgctg ctggatctag ctccgctgca tccaactcta gtggacatat ctccactttc 780 
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tcgggagctg gtgctaaact cgccgttgga gccggagccg gtattgtcgg tcttgcagct 840 
ctgcttatgt aa 852 



<210> 7 

<211> 246 

<212> DNA 

<'213> Hansenula polymorpha 



<400> 7 

atgtcctggc ttacactatt ggtgacgcca gcggttctgc tgcctttcta ctctgctctg 60 

tacgagaaaa ctgctgcagc tcagtcttct cagtactcta gcagccctgc agctgtctct 120 

tcaagaagct ctgcagctgc ctcttctagt gccaaaattg ctacttacga gggtgctgct 180 

gccgagaatg ttgctaaggt cggcatgggt gctttgctgt cgggaatggc cgttcttctt 240 

atgtaa 246 



<210> 8 

<211> 1119 

<212> DNA 

<213> Hansenula polymorpha 
<220> 

<221> inisc_feature 

<222> (43) . . C294) 
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<223> cystein motif 
<220> 

<221> misc_feature 

<222> (760).. (873) 

<223> transmembrane domain 



<400> 8 

atgagattcg gtgtcgtatt cgtgatcacc caagtcctag cagactttac gtatttaggt 60 

tgctacagct cggatgccat ttccggcttg accaagaagg actcgtacac atggcagagc 120 

tccagccatt gcacggagca gtgttcggga cacgcagttg ctgcattgat caacggccag 180 

gactgttatt gcggagacga cgtaccttcg gacaaccccg acggctcgtg cacaacgtcc 240 

tgcactggtt atcccatgga gaagtgtggt ggtagcgact cttattccgt ctatgttgac 300 

gagtctgagg aaaacgacga cgacagttcg tcggcgcagt cgtcgcactc gtccacagac 360 

gatgccactt ccacctcctc cacctccacc acctcctcct ccagttccag tctgtccagc 420 

tcctcaacat cgtcttcctc caagcaaagt tcctccccac agagctccac catgtcttcc 480 

acagactcgt ctccaacgtc ctcaagtctg tccgctagct cgactacaac aagctcgatt 540 

tcgtcctttt ctttctccca gagctcgtcg tcgtcttcaa ccacctcttc gtccacaccc 600 

tcatccgaat cagtgcggat aactacgtcc gtttcacctg gaaacatgca gacctcgatc 660 

atctacatca cgcagtccgt cgctacagcc acctcggcgt cggctgccgc gtcgtcctca 720 
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agtgcctcga gcgccaacaa caggtccacg gggctcagca agggcgcgaa ggccggaatc 780 

gctgtcgggt cgatcctcgg agctttgctg ctcttgggac tcctgctcct gttcctgttt 840 

tggcgcagac gccagcgcga cgacagagac aaccttagcg aaaagcgcgc atccagcatt 900 

ttggcgtcgt cttcccgtca gcctccagct ggctcgcgag gtgcggcagc aggaatcgga 960 

gccaaccgca ttcggattca tgtccgagga cgacagactg gacatgccgg gcacgtcgag 1020 

acggttcagc gacggttcgt tgccgggacg ctgccgctgg agccgcggtg ccgccgaata 1080 

gtgcccgaca ggaggtttgc gggtggtaaa cccagatct 1119 



<210> 9 

<211> 2337 

<212> DNA 

<213> Saccharomyces diastaticus 
<220> 

<221> misc_feature 

<222> (1)..(96) 

<223> secretion signal sequence 

<220> . 

<22 1> mis c_f eat ur e 

<222> (97).. (120) 

<223> octapeptide 
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<220> 

<221> misc_feature 

<222> (121).. (882) 

<223> serine/ threonine rich domain 



<400> 9 

atggtaggcc tcaaaaatcc atatacgcac actatgcaaa gaccatttct actcgcttat 60 

ttggtccttt cgcttctatt taactcagct ttgggttttc caactgcact agttcctaga 120 

ggatcctcct ctagcaacat cacttcctcc ggtccatctt caactccatt cagctctgct 180 

actgaaagct tttctactgg cactactgtc actccatcat catccaaata ccctggcagt 240 

aaaacagaaa cttctgtttc ttctacaacc gaaactacca ttgttccaac tacaactacg 300 

acttctgtca taacaccatc aacaaccact attaccacta cggtttgctc tacaggaaca 360 

aactctgccg gtgaaactac ttctggatgc tctccaaaga ccattacaac tactgttcca 420 

tgttcaacca gtccaagcga aaccgcatcg gaatcaacaa ccacttcacc taccacacct 480 

gtaactacag ttgtctcaac caccgtcgtt actactgagt attctactag tacaaaacaa 540 

ggtggtgaaa ttacaactac atttgtcacc aaaaacattc caaccactta cctaactaca 600 

attgctccaa cttcatcagt cactacggtt accaatttca ccccaaccac tattactact .660 

acggtttgct ctacaggaac aaactctgcc ggtgaaacta cctctggatg ctctccaaag 720 
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actgtcacaa caactgttcc ttgttcaact ggtactggcg aatacactac tgaagctacc 780 

gcccctgtta caacagctgt cacaaccacc gttgttacca ctgaatcctc tacgggtact 840 

aactccgctg gtaagacgac aactagttac acaacaaagt ctgtaccaac cacctatgta 900 

tttgactttg gcaagggcat tctcgatcaa agctgcggcg gtgtattttc aaacaacggc 960 

tcttcgcaag tgcagctgcg ggatgtagtc ttgatgaatg ggacagtggt atacgattca 1020 

aacggcgctt gggacagtag tgcgctggag gagtggctcc agcgacagaa aaaagtttcc 1080 

atcgaaagaa tatttgaaaa tattgggccc agcgccgtgt atccgtctat tttgcctggg 1140 

gtcgtgattg cgtcaccatc gcaaacgcat ccagactact tctaccaatg gataagggac 1200 

agcgcgttga cgataaacag tattgtctct cattctgcgg acccggcaat agagacgtta 1260 

ttgcagtacc tgaacgtttc attccacttg caaagaacca acaacacatt gggcgctggc 1320 

attggttaca ctaacgatac agtggctttg ggagacccta agtggaacgt cgacaacacg 1380 

gctttcacgg aaccttgggg tcgtcctcaa aacgatggcc ctgctcttcg aagcattgcc 1440 

atcttaaaaa tcatcgacta catcaagcaa tctggcactg atctgggggc caagtaccca 1500 

ttccagtcca ccgcagatat ctttgatgat attgtacgtt gggacctgag gttcattatt 1560 

gaccactgga attcttccgg atttgatcta tgggaggaag tcaatggcat gcatttcttt 1620 

actttactgg tacaactgtc tgcagtggac aggtcgctgt cgtattttaa cgcctcagaa 1680 
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cggtcgtctc cctttgttga agaattgcgt cagacacgcc gggacatctc caagttttta 1740 

gtggaccctg cgaatgggtt tatcaacggc aagtacaatt atattgttga gacacccatg 1800 

attgccgaca cattgagatc cggactggac atatccactt tattagctgc gaacaccgtc 1860 

cacgatgcgc catctgcttc ccat'cttccg ttcgatatca atgaccctgc cgtcctgaac 1920 

acgttgcacc atttgatgtt gcacatgcgt tcgatatacc ccatcaacga tagctccaaa 1980 

aatgcaacgg gtattgccct gggccggtat cctgaggacg tatatgatgg atatggcgtt 2040 

ggcgagggaa atccctgggt cctggccacg tgtgccgctt caacaacgct ttatcagctc 2100 

atttacagac acatctctga gcagcatgac ttggttgtcc caatgaacaa cgattgttcg 2160 

aacgcatttt ggagcgagct ggtattctcc aacctcacga ctttgggaaa tgacgaaggc 2220 

tatttgattt tggagttcaa tacacctgcc ttcaatcaaa ccatacaaaa aatcttccaa 2280 

ctagctgatt cattcttggt caagctgaaa gccacgtggg aacagacggg gaactaa 2337 



<210> 10 

<211> 8 

<212> PRT 

<213> Saccharomyces diastaticus 



<400> 10 

Phe Pro Thr Ala Leu Val Pro Arg 
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1 5 



<210> 11 
<211> 131 
<212> PRT 

<213> Hasenula poiymorpha 
<400> 11 

Met Gin Phe Arg Thr Leu Ala Pro Leu Ala Leu Ala Ser Ala Ala Phe 
15 10 15 

Ala Ala Tyr Ser Asn Gly Thr Val Ser Thr He Thr Tyr Glu Thr Thr 
20 25 30 

Val Ser Glu Val Val Thr Ala Leu Thr Thr Tyr Cys Pro Glu Ala Thr 
35 40 45 

Ser He Val Thr Asn Gly Lys Thr Tyr Thr Val Thr Gly Ala Thr Thr 
50 55 60 

Leu Thr He Thr Asp Cys Pro Cys Thr Lys Lys Lys Val He Thr Thr 
65 70 75 80 

Thr Thr Val Thr Thr He Pro Ala Lys Ser Ser Thr Ala Ala Ser Ser 
85 90 95 

Val Ala Ala Ser Ser Ala Pro Val He Ser Thr Ala. Glu Asn Ala Gly 
100 105 110 

Ala Lys Val Gly Ala Ala Gly Leu Ala Ala Leu Ala Gly Ala Ala Ala 
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120 



Phe Leu Leu 
130 



<210> 12 
<211> 337 
<212> PRT 

<213> Hansenula polymorpha 
<400> 12 

Met Lys Phe Thr Ser Ser Leu Ala Ala He Ser Leu Ala Ser Asn Ala 
15 10 15 

Phe Ala Ala Tyr Val Gly Ser Thr Tyr Trp Thr Thr Met Thr Pro Ser 
20 25 30 

Tyr Thr Leu Asp Gly Ala Leu Thr Ser Tyr Ser Ala Thr Phe Gly He 
35 40 45 

Ala Val Glu Pro Leu Glu Thr Ser Ser Ser Val Ser Ala Ser Leu Asn 
50 55 60 

Val Glu Lys Arg Gin Val Val Ser Gin He Gly Asp Gly Gin He Gin 
65 70 75 80 

Ala Thr Thr Asn Thr Glu Lys Glu Thr Ser Lys Ser Ser Thr Ser Thr 
85 90 95 

Ala Ala Ala Val Val Ser Gin He Thr Asp Gly Gin He Gin Ala Thr 
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100 105 110 

Thr Ala Thr Thr Thr Ser Ser Ser Ser Ser Ser Lys Lys Thr Ala Ala 
115 120 125 

Ala Val Val Thr Gin He Gly Asp Gly Gin He Gin Ala Thr Thr Ser 
130 135 140 

Thr Ser Ser Lys Ser Thr Ala Ala Asp Val Val Thr Gin He Gly Asp 
145 150 155 160 

Gly Gin He Gin Ala Thr Ser Lys Ser Ser Ser Thr Ser Thr Ala Ala 
165 170 175 

Asp Val Val Ser Gin He Thr Asp Gly Gin He Gin Ala Thr Thr Ser 
180 185 190 

Thr Lys Ala Ser Ser Ala Thr Thr Ser Gly Val He Ser Gin He Ser 
195 200 205 

Asp Gly Gin He Gin Ala Ser Ser Thr Ala Ser Ser Lys Thr Ser Thr 
210 215 220 

Ala Ser Ser Ser Thr Ala Thr Gly Asp Tyr Val Thr Ser Val Ser Cys 
225 230 235 240 

Lys Lys Glu Gly Ala Leu Ala Met Thr Leu Lys Asp Gly He Leu Tyr 
245 . 250 , 255 

Asp Ser Glu Gly Arg He Gly Ser He Val Ala Asn Arg Gin Phe Gin 
260 265 270 



15 



wo 02/12509 



PCT/KROO/00819 



Phe Asp Gly Pro Pro Pro Gin Ala Gly Ala He Tyr Ala Asp Gly Trp 
275 280 285 

Ser He Ser Pro Asp Gly Tyr Leu Ala He Gly Asn Asp Thr He Phe 
290 295 300 

Tyr Gin Cys Leu Ser Gly Thr Phe Tyr Asn Leu Tyr Asp Gin Ser He 
305 310 315 320 

Gly Gly Gin Cys Asn Lys Val His Leu Lys Ala Val Glu Leu Val Asp 
325 330 335 

Cys 



<210> 13 
<211> 283 
<212> PRT 

<213> Hansenula polymorpha 
<400> 13 

Met Leu Ser Phe Lys Thr Leu Cys Ala Thr Ala Phe Ala Leu Thr Ala 
15 10 15 

Val Ser Ala Ala Pro Gin Ala Asp Pro Phe Ala Phe Ala Asn Ala He 
.20 . 25 30 

Gly Leu Pro Val Ala Ala Glu Ala Thr Tyr Ala Cys His Ala Ser Cys 
35 40 45 
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Gly Tyr Ala He Leu Ala Ala Arg Gin Cys Ser Pro Thr Gly Ser Glu 
50 55 60 

Asp Ala Asn Tyr Asn Ser Thr Cys Leu Cys Ala Ser Asp Ser Gin Phe 
65 70 75 80 

Leu Ser Tyr Val Pro Ala Cys Leu Asp Cys Gly Trp Cys Leu Trp Ser 
85 90 95 

Asp Tyr Gly Ser Phe Leu Thr Ser Ala Leu Ala Glu Cys His Thr Asn 
100 105 110 

Thr Gin Pro Thr Gly Thr Thr Cys Ala Pro Ser Thr Ala Gin Ala Ala 
115 120 125 

Ala Thr Ser Ser Val Ala Ala Ala Ala Ser Glu Val Ser Ser Ser Ser 
130 135 140 

Ala Ala Ala Ser Ser Thr Gin Ala Ala Ala Ala Ala Ser Thr Ser Ala 
145 150 155 160 

Ala Ala Ser Thr Glu Ala Thr Thr Ser Ala Ala Ala Ala Ala Thr Ser 
165 170 175 

Ser Ser Glu Ala Ala Ser Ser Ser Ala His Val His Ser His Ala Ala 
180 185 190 

Glu Ser Thr Ser Ala Val Glu Ser Thr Ser Ala Ala His Ser His Ala 
195 200 205 
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Ala Glu Ser Ser Ser Ala Ala His Ser His Ala Val Glu Ser Ser Ser 
210 215 220 

Ala Ala His Val His Ser His Ala Ala Glu Ser Ser Ser Ala Ala His 
225 230 235 240 

Ser His Ala Ala Gly Ser Ser Ser Ala Ala Ser Asn Ser Ser Gly His 
245 250 255 

He Ser Thr Phe Ser Gly Ala Gly Ala Lys Leu Ala Val Gly Ala Gly 
260 265 270 

Ala Gly He Val Gly Leu Ala Ala Leu Leu Met 
275 280 



<210> 14 
<211> 537 
<212> PRT 

<213> Hansenula polymorpha 
<400> 14 

Met Gin Leu Lys Ser He Leu Ser Leu Thr Gly Leu Leu Ser Thr Thr 
15 10 15 

Leu Ala Leu Pro Thr He Asp Val Val Ser Asn Lys Phe Phe Tyr Ser 
20 25 30 

Asn Asn Gly Ser Gin Phe Tyr Val Lys Gly Val Ala Tyr Gin Lys Asn 
35 40 45 
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Thr Glu Asn Ala Thr Asp Asp Ala Thr Tyr Val Asp Pro Leu Ala Asp 
50 55 60 

Glu Asp Ser Cys Lys Arg Asp He Pro Tyr Leu Gin Asn Leu Gly He 
65 70 75 80 

Asn Val He Arg Val Tyr Ala Val Asp Ala Ser Lys Asp His Asp Gly 
85 90 95 

Cys Met Ser Leu Leu Glu Asp Ala Gly He Tyr Val He Ser Asp Leu 
100 105 110 

Ser Thr Pro Asn Glu Ser He Glu Thr Thr Ser Pro Ser Trp Thr Val 
115 120 125 

Asp Leu Tyr Asn Arg Tyr Ala Thr Val He Asp Met Phe Gin Ser Tyr 
130 135 140 

Asp Asn Val Leu Gly Phe Phe Ala Gly Asn Glu Val He Thr Asn Lys 
145 150 155 160 

Thr Asn Ser Asp Ala Ala Pro Phe Val Lys Ala Ala He Arg Asp Met 
165 170 175 

Lys Gin Tyr Met Lys Asp Asn Asn Tyr Arg Asp He Pro He Gly Tyr 
180 185 190 

Ser Ala Asn Asp Asp Ala Asn Thr Arg Val Pro Ser Ala Asp Tyr Phe 
195 200 205 

Ser Cys Gly Asn Asp Asp Va! Lys Ala Asp Phe Tyr Gly He Asn Met 
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210 215 220 

Tyr Glu Trp Cys Gly Asn Ala Thr Phe Ser Ser Ser Gly Tyr Glu Ala 
225 230 235 240 

Arg Thr Lys Glu Phe Ser Asn Leu Thr He Pro He Phe Phe Ser Glu 
245 250 255 

Tyr Gly Cys Asn Ser Val Lys Pro Arg Glu Phe Thr Glu Val Gin Ala 
260 265 270 

He Tyr Ser Asp Glu Met Thr Asp Val Trp Ser Gly Gly He Val Tyr 
275 280 285 

Met Tyr Phe Gin Glu Glu Asn Asp Tyr Gly Leu Val Ser He Lys Asp 
290 295 300 

Asn Ala Val Ser Thr Leu Gly Asp Tyr Thr Asn Leu Lys Ser Glu Leu 
305 310 315 320 

Ala Lys He Ser Pro Thr Thr Ala Ser Ala Ser Ala Ala Ser Gin Ser 
325 330 335 

Ala Thr Glu Leu Ser Cys Pro Thr Ser Gin Ser Asn Trp Lys Ala Ser 
340 345 350 

Thr Asp Leu Pro Pro Thr Pro Asn Glu Ala Val Cys Asp Cys Leu Glu 

355 360 365 

Ser Ser Leu Lys Cys Val Val Ser Asp Ser Val Asp Ser Asp Asp Tyr 
370 375 380 
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Gly Asp Phe Phe Gly He Val Cys Asp Leu Thr Asp Cys Ser Gin He 
385 390 395 400 

Ser Thr Ser Gly Ser Asn Gly Ser Tyr Gly Ala Tyr Ser Tyr Cys Ser 
405 410 415 

Ala Lys Asp Lys Leu Ser Phe Leu Leu Asn Lys Tyr Tyr Glu Glu Gin 
420 425 430 

Asp Ser Asn Ser Ser Ala Cys Asp Phe Ser Gly Ser Ala Ser Leu Asn 
435 440 445 

Thr Asn Gly Ser Thr Ala Ser Ser Cys Ser Ser Leu Leu Ser Ser Ala 
450 455 460 

Ser Ala Ser Pro Ser Ala Thr Gly Ser Ser Asn Ser Ser Pro Ala Ser 
465 470 475 480 

Gly Ser Gly Ser Ser Ser Gly Ser Ser Ser Gly Ser Gly Ser Ser Ser 
485 490 . 495 

Ser Ser Ser Ser Ser Ser Ser Ala Gly Ala Gly Val Asn Ala Val Pro 
500 505 510 

Leu Ser Ala Pro Gin Leu Gly Leu Leu Ser Leu Phe Ser Thr Phe Phe 
515 520 525 

Leu Gly Gly Leu Ser Tyr He Phe He 
530 535 
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<210> 


15 


<211> 


81 


<212> 


PRT 


<213> 


Hansenut 


<400> 


15 



Met Ser Trp Leu Thr Leu Leu Val Thr Pro Ala Val Leu Leu Pro Phe 
15 10 15 

Tyr Ser Ala Leu Tyr Glu Lys Thr Ala Ala Ala Gin Ser Ser Gin Tyr 
20 25 30 

Ser Ser Ser Pro Ala Ala Val Ser Ser Arg Ser Ser Ala Ala Ala Ser 
35 40 45 

Ser Ser Ala Lys He Ala Thr Tyr Glu Gly Ala Ala Ala Glu Asn Val 
50 55 60 

Ala Lys Val Gly Met Gly Ala Leu Leu Ser Gly Met Ala Val Leu Leu 
65 70 75 80 

Met 
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NOVEL CELL WALL ANCHOR PROTEINS DERIVED FROM 
YEAST, GENES THEREOF AND CELL SURFACE 
EXPRESSION SYSTEMS USING THE SAME 



The present invention relates to novel cell wall 
proteins derived from yeast, genes thereof, and 



10 particularly, the present invention relates to surface 
expression systems using four Hansenula polymorpha- 
derived, GPI {glycosylphosphatidylinositol) -anchor 
proteins, HpSedlp, HpGaslp, HpTiplp and HpCwplp, and a 
Hansenula polymorpha-derived cell wall protein, 

15 HpPir2p for the expression of foreign proteins onto 
the cell surface of Hansenula polymorpha and other 
yeasts. Also, the present invention relates to surface 
expression systems using proteins coded by Hansenula 
polymorpha-der ived • HpWSCl and Saccharomyces 

20 diastaticus-derived STAl for the expression of foreign 
proteins on the cell surface of Hansenula polymorpha 
and other yeasts. 



5 



FIELD OF THE INVENTION 



surface expression systems using the same. 



More 



BACKGROUND ART OF THE INVENTION 



25 



In 



recent 



years. 



active research have been 
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directed to expression of desired foreign proteins on 
the surface of single cell organisms, such as 
bacteriophages, bacteria, and yeasts, because proteins 
expressed on the cell surface have various useful 
5 applications, including production of novel vaccines, 
screening of various antigens and antibodies, 
immobilization of useful enzymes onto the cell surface, 
and the like. 

At first, the expression of foreign proteins on 

10 the cell surface was applied to the screening of 
epitopes and antigenicity-determining peptide 
fragments, with the aim of stable vaccine production. 
Before then, the production of vaccine had been 
achieved by the selection of the mutants, which showed 

15 stable and continuous titers from the randomly mutated 
pathogen library. However, the vaccines produced in 
the conventional manner are likely to lose their 
antigenicity when being administered via oral routes 
to humans or animals. Much effort has been made to 

20 overcome these problems using the live oral vaccines 
displayed on the cell surface. 

In one strategy to express antigenic proteins on 
the cell surface, endogenous cell surface proteins are 
used to guide these proteins onto the cell surface. 

25 For example, a gene encoding a cell surface protein is 
fused to a gene encoding an antigenic protein, and the 
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resulting recombinant gene is introduced into Gram- 
negative bacteria to express the fusion protein on the 
cell surface. The antigenic protein passenger in the 
fusion protein vehicle can act as an effective antigen 
5 to elicit immune responses. Especially, Gram-negative 
bacteria are very efficient for this purpose, because 
the lipopolysaccharide (LPS) presenting on their cell 
envelope enhances the antigenicity of the fusion 
protein expressed on the cell surface. 

10 As a rule, proteins to be secreted or expressed 

on cell surfaces have secretion signals, which allow 
the nascent proteins to pass through the cytoplasmic 
membrane, in their primary sequence. To be expressed 
on the cell surface of Gram-negative bacteria, a 

15 protein translocates across the cytoplasmic membrane 
and the periplasmic space and embedded in the outer 
membrane so as to protrude to the outer membrane. In 
bacteria, several enzymes and toxins have secretion 
signals and (or) targeting signals, which induce the 

20 proteins to target regions. Therefore, using such a 
secretion signal or targeting signal with the suitable 
promoter can successfully perform localization of a 
foreign protein on cell surfaces. 

Thus far, extensive attempts have been made to 

25 utilize surface proteins of Gram-negative bacteria in 
expressing foreign proteins of interest onto the cell 
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surface. The surface proteins used for the 
localization of foreign proteins can be largely 
classified into outer membrane proteins, lipoproteins, 
secretory proteins, and cell surface organelle 
5 proteins . 

LamB, PhoE and OmpA, known as outer membrane 
proteins of Gram-negative bacteria, have been used for 
the production of foreign proteins on cell surfaces. 
When the outer .membrane proteins are employed, foreign 

10 proteins are limited in size, because they must be 
inserted into the loops protruded out of the cell 
surface. Additionally, since the C- and N-termini of 
the foreign proteins to be inserted are required to be 
located near each other in the three dimensional 

15 structure, both termini must be brought close to each 
other by the use of a peptide linkage when the 
distance therebetween is large. 

In fact, where LamB or PhoE is used, insertion of 
a foreign polypeptide as large as or larger than 50-60 

20 amino acids failed to construct a stable membrane 
protein by the steric hindrance. (Charbit et al., J. 
Immunol., 1997, 139, 1658-1664; Agterberg et al.. 
Vaccine, 1990, 8, 85-91) . To solve this problem, an 
OmpA fragment containing a minimum target signal 

25 necessary for proper location was tried. For example, 
P-lactamase linked to the C-terminus of a target 
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signal of OmpA was successfully expressed on the cell 
surface. Protein translocation from the cytoplasm to 
the outer membrane was achieved by the fusion of the 
signal sequence for the lipoprotein Lpp of E. coll to 
the N-terminus of OmpA (Francisco et al., Proc. Natl. 
Acad. Sci. USA, 1992, 489, 2713-2717) . 

As described above, use of bacterial outer 
membrane proteins in cell surface display of foreign 
proteins requires linkage between the foreign proteins 
and appropriate outer membrane proteins at a gene 
level, so as to synthesize fusion proteins capable of 
passing through the cytoplasmic membrane and being 
stably embedded in the outer membrane . Suitable 
surface anchoring motif is an outer membrane protein 
which satisfies the following requirements: 1) to have 
a secretion signal which allows the fusion protein to 
pass through the cytoplasmic membrane; 2) to have a 
target signal which allows the fusion protein to 
anchor in the outer membrane; 3) to be expressed on 
the outer membrane in large quantities; and 4) to be 
expressed stably irrespective of protein size. Thus 
far, a surface-anchoring motif, which meets all of the 
requirements, has not yet been developed. 

Meanwhile, lipoproteins have also been used as a 
surface-anchoring motif. Especially, lipoprotein from 
E. coll is very useful, because it translocates across 
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the inner membrane by the N-terminal secretion signals 
and directly linked to outer or inner membrane lipids 
via the covalent bond of its terminal L-cystein. Lpp, 
a major lipoprotein from E. coli, which is associated 
5 with the outer membrane at its N-terminus and with the 
cell wall peptidoglycan (PG) at its C-terminus, can be 
used to secrete and transport foreign proteins onto 
the surface of E. coli by fusion with the outer 
membrane protein A (OmpA, Francisco et al . , Proc. Natl. 

10 Acad. Sci. USA, 1992, 489, 2713-2717). Another 
lipoprotein used in the surface expression of foreign 
protein is TraT. It was reported that TraT has been 
used to express peptides such as polioviral C3 epitope 
on the cell surface of E. coli (Felici et al., J. Mol. 

15 Biol.r 1991, 222, 301-310). Additionally, a 
peptidoglycan-associated lipoprotein (PAL), whose 
function has not been elucidated clearly yet, was used 
for the surface expression of a recombinant antibody 
(Fuchs et al., Bio/Technology, 1991, 9, 1369-1372). In 

20 this case, C-terminus of the PAL was associated with 
peptidoglycan and N-terminus of it was fused to the 
recombinant antibody exposed on the cell surface. 

Secretory proteins, which pass through the outer 
membrane, may be used as the surface anchor, but these 

25 are not well developed in Gram-negative bacteria. Only 
a few proteins have the secretory mechanisms by aid of 
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the helper proteins. For instance, the lipoprotein 
pullulanase secreted from Klebsiella oxytoca is 
anchored on the outer membrane via a linkage between 
the lipid and its N-terminus, and completely secreted 
5 into a culture medium during the growth-resting phase. 
Kornacker et al.' have been tried to express p- 
lactamase on the cell surface using the N-terminal 
fragment of pullulanase, but the expressed 
pullulanase-p-lactamase fusion protein was released to 

10 the cell media after short period of anchoring. When 
using alkaline phosphatase, a periplasmic space 
protein, as a target protein, the surface expression 
was not achieved. Functional expression of alkaline 
phosphatase appears to be difficult because at least 

15 14 proteins are necessary for the secretion of this 
protein {Kornacker et al., Mol. Mxcrol. , 1990, 4, 
1101-1109) . 

The IgA protease derived from Neisseria, a 
pathogenic microorganism possessing an interesting 

20 secretion system, has a secretion signal at the C- 
terminal beta-domain and this signal guides the N- 
terminal protease domain onto the cell surface. This 
protease is secreted into the culture medium by its 
own catalytic hydrolysis after being anchored on the 

25 outer membrane. Using the IgA protease beta-subunit, 
the 12kDa form of cholera toxin B-subunit was surface 
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expressed (Klauser et al., EMBO J., 1990, 9, 1991- 
1999) . However, the protein folding occurred in the 
periplasmic space during the transport prevents the 
secretion of the fusion protein. 
5 Proteins from cell surface organelles of Gram- 

negative bacteria, such as flagella, pili, and 
fimbriae, may be also available as surface anchoring 
motifs. For example, a flagellin, the subunit protein 
of flagellar filament, was used for the stable surface 

10 expression of cholera toxin B subunit and a B-type 
hepatitis viral peptide, and which were found to 
strongly react with their corresponding antibodies 
(Newton et al.. Science, 1989, 244, 70-72). When using 
fimbrilin, a fimbria subunit protein, as a surface 

15 anchoring motif, only small size peptides were 
successfully expressed (Hedegaard et al . , Gene, 1980, 
85, 115-124). 

Similar attempts have recently been developed in 
Gram-positive bacteria using surface proteins of Gram- 

20 positive and negative bacteria as surface anchoring 
motifs (Samuelson et al., J. Bacterial., 1995, 177, 
1470-1476) . A malaria blood stage antigen consisting 
of 80 amino acid residues and a albumin-associated 
protein from Streptococcus protein G were effectively 

25 expressed on the cell surface of Gram-positive 
bacteria using a surface expression system containing 
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the Staphylococcus aureus-derived Protein A as a 
surface anchoring motif and a secretion signal • from 
Staphylococcus hyicus-derived lipase. 

As a result of extensive research on surface 
5 expression in Gram-positive and Gram-negative bacteria, 
various surface expression systems are developed and 
patented in the U. S. A., Europe, and Japan. In the 
past three years, eight patents concerning surface 
expression systems have been issued, among which, the 

10 case using outer membrane proteins of Gram-negative 
bacteria is five (WO9504069, W09324636, WO9310214, 
EP603672 and US5356797) , using a pilus, a cell surface 
organelle, is one (WO9410330), and using a cell 
surface lipoprotein is one (WO9504079) . 

15 The most widely used bacterial host for foreign 

protein production is E. coll because it is easy to 
culture and its gene structure is well known. Foreign 
proteins, however, are not well secreted into culture 
media under ordinary conditions of E. coli. 

20 Additionally, when foreign proteins are excessively 
expressed, they are accumulated as inclusion bodies 
within the cell. Accordingly, the purification of them 
requires a refolding process for solubilizing the 
inclusion bodies, which results in a significant 

25 reduction in yield. Further, since E. coll produces 
endotoxins harmful to the hioman body, the recombinant 
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proteins may be contaminated with the toxins when 
being purified. 

In contrast, yeast has been studied as a host for 
producing useful foreign proteins by genetic 
5 engineering techniques because it can easily secrete 
proteins in active forms into the culture media under 
the control of its own intracellular secretion system, 
which is operated in a manner similar to that of 
higher eucaryotic organisms. 

10 Since the production of interferon in 1981 

(Hitzeman et al., 1981, nature, 293; 717-722), yeast 
has been extensively utilized for the production of 
foreign proteins. In addition, not only were 

recombinant proteins of yeast approved by the FDA of 

15 the U. S. A. for their safety to the human body, but 
also most of the regulatory mechanisms of gene 
expression in yeast are known (Strathern et al.. The 
Molecular Biology of the Yeast Saccharomyces , 
Metabolism and Gene Expression, Cold Spring Harbor 

20 Laboratory, N. Y., 1982). Accordingly, yeast system 
provides several significant advantages for the 
production of foreign proteins. For example, the 
proteins expressed in yeast are safe to the human body, 
and extracellularly secreted in conformations 

25 retaining high specific activity, similar to those 
expressed in animal or human cells. Furthermore, the 
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purification processes for the proteins produced from 
yeast is simple compared to E. coll, and requires no 
refolding processes to obtain active forms, thus 
showing high production yield. Particularly, the 
5 surface expression in S. cerevisiae has recently been 
under extensive study. A few years ago, studies on the 
surface expression of foreign proteins in yeast were 
mainly focused on a-agglutinin, a typical cell wall 
protein as a surface anchoring motif (Schreuder et al . , 

10 Yeast, 1993, 9, 399) . In recent years, the study about 
surface anchoring motif has been extended to various 
cell wall proteins. Above all, the screening of 
surface anchor proteins through the conserved sequence 
analysis is extensively achieved, as the genome 

15 project for S. cerevisiae has been completed (Hamada 
et al., Mol. Gen. Genet., 1998, 258, 53). Using such 
surface proteins as surface anchoring motifs, various 
enzymes, including a-galactosidase, glucoamylase, 
lipase, and cutinase have been stably expressed on the 

20 cell surface of S. cerevisiae. In addition, expression 
of various enzymes on cell surface could develop many 
useful industrial biocatalysts (Murai et al., Appl. 
Microbiol. Biotechnol., 1999, 51, 65). A surface 
expression system using a-agglutinin as an anchoring 

25 motif has been developed and commercialized by 
Invitrogen Corporation. Furthermore, yeast, which is a 

11 
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eucaryotic microorganism usually harmless to the human 
body, is highly useful as a host for producing 
proteins for use in food or medical materials. For 
instance, B-type hepatitis viral antigen (HbsAg) was 
5 expressed on the yeast cell surface with the aim of 
developing live vaccines (Schreuder et al . , Vaccine, 
1996, 14, 383) . 

As mentioned above, active research has been 
directed to cell wall proteins throughout the world, 

10 but limited to S. cerevxsiae. At present, researches 
on the cell wall proteins of other yeasts, such as 
Candida albicans, are in the initial stage. Therefore, 
there remains an urgent need for studies on surface 
proteins and surface expression thereof in 

15 industrially useful yeasts. Because the mediating 
proteins that have been studied thus far make use of 
glycosylphosphatidylinositol-anchor for surface 

anchoring, the proteins of interest must be linked to 
the anchor proteins at their carboxyl termini. However, 

20 some of the proteins may not exhibit their full 
activity under such a condition, and this is 
recognized as a drawback in yeast surface expression 
system. 

25 SUMMARY OF THE INVENTTOKI 
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Leading to the present invention, the intensive 
and thorough research on extracellular transport of 
exogenous proteins, conducted by the present inventors, 
resulted in the finding that surface proteins derived 
5 from Hansenula polymorpha, an industrially useful 
methanol -assimilating yeast, are highly effective in 
construction of surface expression systems and 
development of biocatalyst application systems. The 
surface expression system established in the present 

10 invention can stably express proteins of interest onto 
the cell surface, thus finding numerous applications 
in various fields, including immobilization of 
biocatalysts and large-scale production of proteins, 
such as enzymes, antigens, antibodies, etc. 

15 Therefore, it is an object of the present 

invention to resolve the foregoing or other problems 
encountered in prior arts and to provide systems for 
the exportation of exogenous polypeptides to discrete 
regions of a host cell in which they are expressed, 

20 and uses thereof. 

It is another object of the present invention to 
provide a novel surface anchor protein and its gene, 
derived from Hansenula polymorpha. 

It is a further object of the present invention 

25 to provide a surface expression system using all or 
part of the surface proteins as mediators for surface 
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expression of exogenous polypeptides or proteins. 

It is still a further object of the present 
invention to provide an expression mediator protein, 
isolated form Hansenula polymojrpha, which can be fused 
5 to the amino termini of proteins to be expressed on 
the cell surface, and a gene encoding the mediator 
protein. 

It is still another object of the present 
invention to provide a surface expression system in 

10 which all or part of the expression mediator protein 
which can be fused to the amino termini of proteins to 
be expressed, is employed. 

It is still another object of the present 
invention to provide a surface expression system in 

15 which all or part of the known surface protein derived 
from Saccharomyces dlastatlcus and Hansenula 
polymorpha , is employed. 

In accordance with an aspect of the present 
invention, there are provided the isolated DNA 

20 sequences, containing the base sequence HpSEDl 
represented by the SEQ. ID. NO: 1, the base sequence 
HpRIR2 represented by the SEQ. ID. NOr 4, the base 
sequence HpGASl represented by the SEQ. ID. NO: 5, the 
base sequence HpTIPl represented by the SEQ. ID. NO: 6, 

25 and the base sequence HpCWPl represented by the SEQ. 
ID. NO: 7, all encoding novel cell wall proteins of 
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Hansenula polymorpha, and their DNA homologues. 

In accordance with another aspect of the present 
invention, there are provided novel E. coll strains 
harboring recombinant vector containing a 4 kb EcoRI 
5 fragment of the base sequence HpSEDl, a 5.5 kb Sail 
fragment of the base sequence HpPIR2, a 3 kb PstI 
fragment of the base sequence HpGASl, a 3.5 kb Xbal 
fragment of the base sequence HpTIPl, and a 6 kb Sail 
fragment of the base sequence HpCWPl . 

10 In accordance with a further aspect of the 

present invention, there are provided surface 
expression systems in which base sequences of HpSEDl^ 
HpPIR2, HpGASl, HpTIPl, HpCWPl, STAl from 
Saccharomyces diastaticus, HpNSCl from Hansenula 

15 polymorpha and NSCl from Saccharomyces cerevisiae or 
their partial fragments are used to express mediator 
proteins which locate exogenous proteins onto the 
surface of a cell. Herein, useful in the present 
invention is a eukaryotic cell selected from the group 

20 consisting of yeast spp. , including Candida spp. , 
Debaryomyces spp., Hansenula spp., Kluyveromyces spp., 
Pichla spp., Schizosaccharomyces spp., Yarrowia spp., 
Saccharomyces spp. ; and mold spp. including 
Aspergillus spp. , Penicillium spp. , and Bhizopus spp. 

25 
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BRIEF DESCRIPTION OF THE DRAWTNnS 

The above and other objects, features and other 
advantages of the present invention will be more 
5 clearly understood from the following detailed 
description taken in conjunction with the accompanying 
drawings, in which: 

Fig . 1 is a photograph showing Southern blot 
analysis results of DNA fragments obtained by treating 
10 the genome of Hansenula polymorpha with BamHI (lane 3), 
EcoRI (lane 4), PstI (lane 5), and Xbal (lane 6), 
along with a EcoRI fragment of the Saccharomyces 
cerevisiae genome (lane 1) and a marker (lane 2) after 
the DNA fragments are hybridized with a SEDl gene as a 
15 probe . 

Fig. 2a is a photograph showing Southern blot 
analysis results of DNA fragments obtained by treating 
the genome of Hansenula polymorpha with Clal (lane 1) , 
EcoRI (lane 2), Hindlll (lane 3), Sail (lane 4) and 

20 Xhol (lane 6), along with a marker (lane 6) and an 
EcoRI fragment of the Saccharomyces cerevisiae (lane 
7) after the DNA fragments are hybridized with a CWP2 
gene as a probe. 

Fig. 2b is a photograph showing Southern blot 

25 analysis results of DNA fragments obtained by treating 
the genome of Hansenula polymorpha with BamHI (lane 1), 
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Clal (lane 2), EcoRI (lane 3), Hindlll (lane 4), PstI 
(lane 5), Xbal (lane 6), and Xhol (lane 7), along with 
a marker (lane 8) and an EcoRI fragment of 
Saccharomyces cerevisiae genome after the DNA 
5 fragments are hybridized with a TIRl gene as a probe. 

Fig. 3 is a photograph showing Southern blot 
analysis results of DNA fragments obtained by treating 
the genome of Hansenuia polymorpha with Clal (lane 1), 
Hindlll (lane 2), Sail (lane 3), Xhol (lane 4), BamHI 
10 (lane 7), EcoRI (lane 8), PstI (lane 9), and Xbal 
(lane 10), along with a marker (lane 5 and 6), after 
the DNA fragments are hybridized with a GASl gene as a 
probe . 

Fig. 4 is a photograph showing Southern blot 
15 analysis results of DNA fragments obtained by treating 
the genome of Hansenula polymorpha with BamHI (lane 1), 
EcoRI (lane 2), Clal (lane 3), PstI (lane 4), Xbal 
(lane 5), Xhol (lane 6), EcoRI (lane 9), Xbal (lane 
10), Xhol (lane 11), along with a marker (lanes 7 and 
20 12) and an EcoRI fragment of the Saccharomyces 
cerevisiae genome (lane 8), after the DNA fragments 
are hybridized with a TIPl gene as a probe. 

Fig. 5 is a photograph showing Southern blot 
analysis results of DNA fragments obtained by treating 
25 the genome of Hansenula polymorpha with Clal (lane 1), 
Hindlll (lane 2), PstI (lane 3), Sail (lane 4), and 
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Xhol (lane 5), after the DNA fragments are hybridized 
with a CWPl gene as a probe. 

Fig. 6 is a schematic diagram showing a surface 
expression vector for expressing CMCase on the cell 
5 surface of Hansenula polymorpha, which contains a 
GAPDH promoter (pGAPDH) , a killer toxin signal 
sequence (KTsig) and a CMCase gene with a termination 
codon (Term) . 

Fig. 7 is a histogram showing the CMCase activity 
10 detected on the cell surface of Hansenula polymorpha 
according to surface expression mediators. 

Fig. 8 shows graphs in which the CMCase activity 
detected on cell surfaces of Hansenula polymorpha 
strains, which have surface expression mediators is 
15 plotted with respect to culture time period. 

Fig. 9 is a schematic diagram showing a surface 
expression vector for expressing the protein HpPir2p 
on the cell surface of Hansenula polymorpha, which 
contains a GAPDH promoter (pGAPDH) , a killer toxin 
20 signal sequence (KTsig) and a CMCase gene with a 
termination codon (Term> , and a full length of a PIR2 
gene (Pir-N) fused to the amino terminus of the killer 
toxin signal sequence (KTsig) or a partial fragment of 
a PIR2 gene (Pir-C) fused to the carboxy terminus of 
25 the CMCase gene. 

Fig. 10a is a schematic diagram showing domains 
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of the protein Wsclp and their cellular location sites. 

Fig. 10b is a schematic diagram showing surface 
expression vectors for expressing exogenous proteins 
on the cell surface of Hansenula polymorpha, in which 
5 several truncated HpWSCl gene fragments, devoid of 
either the cystein motif or the transmembrane domain, 
and of both of them, and an intact HpWSCl gene 
encoding the full length of HpWsclp, are inserted. 

Fig. 11 is a photograph showing the glucose 
10 oxidase activity measured according to the plate 
activity assay method from Hansenula. polymorpha 
strains which use expression mediators (Tip40, GasF, 
CwpF and Wsc-ct) and no expression mediators (GOD*) . 

Fig. 12 shows FACS analysis results of Hansenula 
15 polymorpha strains (white) on which glucose oxidase is 
expressed by employing surface expression mediators 
such as CwpF (panel A) , Tip40 (panel B) , GasF (panel 
C) , and Wsc-ct (panel D) , along with a wild type 
(gray) . 

20 Fig. 13 is a schematic diagram showing surface 

expression systems for expressing exogenous proteins 
on the cell surface of Saccharomyces cerevisiae by use 
of the gene STAl, which are constructed with vectors 
comprising the signal sequence only (STASS) , both of 

25 the signal sequence and the octapeptide sequence 
(STASS-8), all of the signal sequence, the octapeptide 
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sequence and the threonine/serine-rich domain (STASS- 
8-TS) , both of the signal sequence and the 
threonine/serine-rich domain (STASS-TS), both of the 
signal sequence and alpha-agglutinin (STASS-CMC-Agalc) , 
5 and all of the signal sequence, the octapeptide, and 
alpha-agglutinin (STASS-8-CMC-Agalc) , along with a 
CMCase gene as a reporter gene. 

Fig. 14 is a photograph showing the CMCase 
activity measured according to the plate activity 

10 assay method from Saccharomyces cerevisiae strains 
which harbor the signal sequence only (SS) , both of 
the signal sequence and the octapeptide sequence (SS- 
8), all of the signal sequence, the octapeptide 
sequence and the threonine/serine-rich domain {SS-8- 

15 TS), and both of the signal sequence and the 
threonine/serine-rich domain (SS-TS) . 

Fig. 15 shows FACS analysis results of 
Saccharomyces cerevisiae strains on which CMCase is 
expressed by employing surface expression vectors 

20 (gray), along with a wild type (white). 

DETAILED DESCRIPTION OF THE INVENTION 

Established in the present invention are systems 
25 for the export of exogenous polypeptides to discrete 
regions of a yeast cell in which it is expressed, 

20 
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which can be used for regenerable whole-cell 
biocatalyst. 

To circumvent the problems associated with 
Saccharomyces cerevisiae, which has been used for the 
5 industrial production of recombinant proteins, 
Hansenuia polymorpha, a methylotrophic yeast like 
Pichia pastoris, is studied for the expression of 
recombinant proteins. Because the cell wall structure 
of Hansenula polymorpha is not yet known, genes, which 

10 encode proteins mediating the exportation of exogenous 
proteins to the cell surface, are screened and cloned 
in the present invention. In addition, the cloned 
genes are fused to genes coding for proteins of 
interest to construct novel surface expression systems 

15 capable of expressing exogenous proteins on yeast cell 
surfaces . 

Therefore, the present invention pertains to 
genes encoding cell wall proteins of Hansenula 
polymorpha . 

20 Cell wall proteins of Hansenula polymorpha were 

identified to be very similar in characteristics to 
those of Saccharomyces cerevisiae. Genes coding for 
Cwplp (GenBank D37975, van der Vaart et al., J. 
Bacterid., 1995, 177, 3104), Cwp2p (GenBank Z28096, 

25 Van der Vaart et al . , J. Bacterial. ^ 1995, 177, 3104), 
Sedlp (GenBank X66838, Seidel, J. and W. Tanner, Yeast, 

21 
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1997, 13, 3104)/ Tirlp (GenBank X12775) , and Tiplp 
(GenBank M71216, Van der Vaart et al . , J. Bacterial., 
1995, 177, 3104), all known as cell wall proteins, and 
Gaslp (GenBank X53424, Benghezal et al., J. Cell Biol., 
■ 5 1995, 130, 1333) known as a cytoplasmic membrane 
protein, were used to search for corresponding ones 
from the genome of Hansenula polymorpha. 

(1) Isolation of HpSEDl Gene of Hansenula 

10 polymorpha 

Encoded by the gene SEDl , 1,017 bp long, Sedlp is 
a protein which consists of 338 amino acid residues 
and a predominant protein on the cell wall of 
Saccharomyces cerevisiae in the stationary phase. 

15 Using a SEDl gene segment of S. cerevisiae as a 

probe, a 4 kb EcoRI fragment was obtained from the 
genome of Hansenula polymorpha (see Fig. 1) and 
inserted into the vector. The SEDl homologous gene 
fragment of Hansenula polymorpha named as HpSBDlr was 

20 identified to have the base sequence represented by 
the SEQ. ID. NO: 1, which codes for an open reading 
frame consisting of 131 amino acid residues 
represented by the SEQ. ID. NO: 11.- This HpSEDl gene 
fragment of Hansenula polymorpha was much shorter than 

25 the SEDl gene of S. cerevisiae. However, the HpSedlp 
encoded by the HpSEDl gene of Hansenula polymorpha has 
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the same amino acid repeating sequences, shows a 
similar threonine/serine-rich structure, and shares 
homology of 58.4% in amino acids sequence with S. 
cerevisiae. In addition, the HpSedlp of Hansenula 
5 polymorphs was seemed to have a signal sequence 
consisting of 17 amino acids, and a putative GPI- 
anchor signal at its carboxyl terminus. A novel E. 
coli transformant harboring a recombinant vector 
containing the 4 kb EcoRI fragment of the HpSEDl gene 
10 was deposited with the Korean Collection for Type 
Culture of Korea Research Institute of Bioscience and 
Biotechnology (KRIBB) under the accession No. KCTC 
0825BP on July 11, 2000. 

15 (2) Isolation of HpPIR2 Gene of Hansenula 

polymorpha 

Encoded by the TIRl gene, 7 65 bp long, Tirlp is a 
protein found at the cell well of S. cerevisiae, 
consisting of 254 amino acids. The protein shows hyper 

20 N-glycosylation and is rich in serine/alanine and well 
expressed under the anaerobic condition. 

Using a TIRl gene fragment of S. cerevisiae as a 
probe, a 5 kb Clal fragment was obtained from the 
genome of Hansenula polymorpha (see Fig. 2b) and 

25 inserted into a vector. DNA sequence analysis revealed 
that the ORF in the Clal fragment shares homology of 
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51.2% with the PIR2 gene of S. cerevisiae (GenBank 
D13741, Toh-e et al., Yeast, 1993, 9, 481). 

Cwp2p, consisting of 92 amino acids, is known as 
a most effective surface anchor except a-agglutinin, 
5 among the proteins of S. cerevisiae (van der Vaart et 
al., Appl. Environ. Microbiol., 1997, 63, 615). In the 
present invention, a 5.5 kb Sail fragment obtained 
from the genome of Hansenula polymorpha using a S. 
cerevisiae CWP2 gene as a probe (see Fig. 2a), was 

10 identified as the same fragment with the Clal fragment 
cloned by TIRl by DNA sequencing. Both proteins, Cwp2p 
and Tirlp, were reported to contain the repeating 
amino acid sequences represented by the SEQ. ID. NO: 2, 
called PIRl/2/3 repeat which made strong homology with 

15 Pir2p protein. 

The Pir2p of S. cerevisiae coded by PIR2 is known 
to be released from the cell wall by alkali treatment, 
usually have a Kex2 cleavage site, also this protein 
is anchored on the cell wall in a form different from 

20 that of the GPI (glycosylphosphatidylinositol) -anchor 
protein and is not released by glucanase treatment 
(Toh-e et al.. Yeast, 1993, 9, 481). The N-terminal 
amino acids analysis of major cell wall protein 
released by alkali treatment from Hansenula polymorpha 

25 cell wall, showed the same amino acids sequence with 
the HpPir2p polypeptide following the kex2p cleavage 
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site/ represented by the SEQ. ID. NO: 3. Thus, the 
HpPIR2 gene obtained from Hansenula polymorpha was 
expected to be a gene coding for the HpPir2p, which 
located in the cell wall. The HpPir2p of Hansenula 
5 polymorpha is encoded by a 1,014 bp gene represented 
by the SEQ. ID. NO: 4, consisting of 337 amino acids 
with PIRl/2/3 repeat. This protein possessed a 
secretion signal sequence consisting of 18 amino acids 
and a Kex2p cleavage site at amino acid residue 68 . A 

10 novel E. coli transformant harboring a recombinant 
vector containing the 5.5 kb Sail fragment of the 
HpPIR2 gene of Hansenula polymorpha was deposited with 
the Korean Collection for Type Culture of Korea 
Research Institute of Bioscience and Biotechnology 

15 (KRIBB) under the accession No. KCTC 0827BP on July 11, 
2000. 

(3) Isolation of HpGASl gene of Hansenula 
polymorpha 

20 Encoded by the GASl gene, 1, 680 bp long, Gaslp 

consists of 559 amino acids and is linked to the cell 
membrane of Saccharomyces cerevisiae. However, its 
amino terminal domain is exposed to the exterior, 
penetrating the cell wall. 

25 In the present invention, using a GASl gene 

fragment of S. cerevisiae as a probe, a 3 kb PstI 
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fragment was obtained from the genome of Hansenula 
polymorpha (see Fig. 3) and inserted into a vector. 
DNA sequence analysis revealed an ORF shares 70.7% 
homology with the Gaslp of S. cerevisiae and this was 
5 named as HpGaslp. HpGaslp of Hansenula polymorpha 
consists of 537 amino acids represented by the SEQ. ID. 
NO: 14, which is encoded by the gene represented by 
the SEQ. ID. NO: 5, 1,614 bp long. The Gaslp of 
Hansenula polymorpha also possesses a secretion signal 

10 sequence consisting of 18 amino acids, but GPI-anchor 
signal was different with that of S. cerevisiae. A 
novel E. coli transformant harboring a recombinant 
vector containing 3 Jcb PstI fragment of the HpGASl 
gene of Hansenula polymorpha was deposited with the 

15 Korean Collection for Type Culture of Korea Research 
Institute of Bioscience and Biotechnology (KRIBB) 
under the accession No. KCTC 0828BP on July 11, 2000. 

(4) Isolation of HpTIPl Gene of Hansenula 
20 polymorpha 

Encoded by the 633 bp, TIPl gene of Saccharomyces 
cerevisiae, Tiplp is a GPI-anchor protein consisting 
of 210 amino acids with high homology with Tirlp and 
the expression of Tiplp is induced by cold shock. 
25 In the present invention, using a TIPl gene 

fragment of S. cerevisiae as a probe, a 3.5 kb Xbal 
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fragment was obtained from the genome of Hansenula 
polymorpha (see Fig. 4) and inserted into a vector. 
DNA sequence analysis revealed an ORF consisting with 
852 bp sequence represented by the SEQ. ID. NO: 6, 
5 coding for 283 amino acids represented by the SEQ. ID. 
NO: 13. A novel E. coll transformant harboring a 
recombinant vector containing the 3.5 kb Xbal fragment 
of the HpTIPl gene of Hansenula polymorpha was 
deposited with the Korean Collection for Type Culture 
10 of Korea Research Institute of Bioscience and 
Biotechnology (KRIBB) under the accession No. KCTC 
0824BP on July 11, 2000. 

(5) Isolation of HpCffPl Gene of Hansenula 
15 polymorpha 

Cwplp is a typical GPI-anchor protein, which is 
encoded by the 720 bp, CWPl gene of Saccharomyces 
cerevisiae, consisting of 239 amino acids. 

In the present invention, using the full CWPl 
20 gene of S. cerevisiae, a 6 kb Sail fragment was 
obtained from the genome of Hansenula polymorpha (see 
Fig. 5) and cloned into a vector. DNA sequence 
analysis revealed a putative ORF with 24 6 bp sequence 
represented by the SEQ. ID. NO: 7, which was named as 
25 HpCWPl, even though this protein showed low homology 
with the CWPl of 5. cerevisiae. Additionally, the 
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HpCwplp represented by the SEQ. ID. NO: 15, was 
deduced to have a secretion signal consisting of 15 
amino acids, also the putative GPI -anchor signal. A 
novel E. coli transf ormant harboring a recombinant 
5 vector containing 6 kb Sail gene fragment coding for 
the HpCwplp of Hansenula. polymorpha was deposited with 
the Korean Collection for Type Culture of Korea 
Research Institute of Bioscience and Biotechnology 
(KRIBB) under the accession No. KCTC 0826BP on July 11, 
10 2000. 

(6) Isolation of HpWSCl Gene of Hansenula 
polymorpha 

Wsclp is a protein responsible for stress 
15 response, anchored in the cell membrane of 
Saccharomyces cerevisiae. This stress response protein 
was reported to have a carboxy-terminal transmembrane 
domain, a serine/threonine-rich domain traversing the 
cell wall, and an amino-terminal cystein motif which 
20 is exposed to the exterior, functioning to detect 
external signals (Verna et al . , Proc. Natl. Acad. Sci. 
USA, 1997, 94, 13804). 

Based on the fact that fluorescence was observed 
on the cell surface when a green fluorescence protein 
25 (GFP) was expressed as a fusion partner with Wsclp, it 
was deduced that the amino terminal domain of Wsclp 
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could be used as a surface-expression mediator. In the 
process of isolating the LEU2 gene of Hansenula 
polymorpha, the HpWSCl gene was obtained by Agaphonov 
et al. (Agaphonov et al . , i^east, 1994, 10, 509, 
5 GenBank U00889) . DNA sequencing analysis disclosed 
that the HpWSCl gene of Hansenula polymorpha consists 
of the 1,110 bp sequence represented by the SEQ. ID. 
NO: 8 encoding the 37 3 amino acid sequence. The 
protein encoded by the HpWSCl gene isolated from 

10 Hansenula polymorpha was assumed as a cell surface 
protein different in anchoring mechanism from GPI- 
anchor proteins and could be used to develop surface 
expression systems by fusion to the amino terminal 
domain of foreign proteins . 

15 In accordance with the embodiment of the present 

invention, there are provided Hansenula polymorpha- 
derived cell wall protein genes, HpSEDl , HpPIR2 , 
HpGASlr HpTIPl, and HpCfVPI, and a membrane protein 
gene HpWSCl gene, vectors containing them, and cells 

20 transformed with the vectors. 

Also, the present invention pertains to surface 
expression systems using novel cell wall proteins 
isolated from Hansenula polymorpha as mediators for 
surface expression of foreign proteins. 

25 In accordance with another embodiment of the 

present invention, there are provided surface 

29 
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expression systems using GPI-anchor proteins isolated 
from Hansenula polymorpha, as mediators for surface 
expression of foreign proteins - 

Expected to be useful as mediator genes for 
5 surface expression systems, four putative GPI-anchor 
protein genes are cloned from Hansenula polymorpha. 
Along with each of the mediators, carboxyl methyl 
cellulase (hereinafter referred to as "CMCase") 
derived from Bacillus suhtills (Park et al . , Agric. 

10 Biol. Chem., 1991, 55, 441) is used in the surface 
expression systems as a reporter protein, in 
accordance with the present invention . 

In this regard, a GAPDH promoter (Sohn et al., 
Appi. Microbiol. Biotechnol., 1999, 51, 800) and a 

15 killer toxin signal sequence (Sor, F. and Fukuhara, 
Curr. Genet., 1985, 9, 147) were inserted into an 
expression vector. Then, the CMCase gene was fused in 
frame to construct a CMCase surface expression vector, 
which can express CMCase under the control of the 

20 GAPDH promoter. Four putative GPI-anchor protein genes, 
HpCWPl, HpGASlr HpTIPl and HpSEDl gene fragments, are 
individually inserted to the expression vector at a 
carboxy terminal region of the CMCase to construct a 
CMCase surface expression vector. Based on the fact 

25 that a GPI-anchor motif located within the 40 amino 
acids of carboxy terminus (van der Vaart et al., Appl. 
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Environ. Microbiol., 1997, 63, 615), CMCase surface 
expression vectors were also constructed with a 
nucleic acid segment encoding the carboxy-terminal 40 
amino acids of each mediators . After all of the 
5 surface expression vectors were introduced to 
Hansenula polymorpha (Hill et al., Nucl. Acids Res., 
1991, 19, 5791), the transf ormants thus obtained were 
cultured and the CMCase activity of them was measured 
in the supernatant and the whole cell fraction. 

10 In whole cell fractions, the expression vectors 

in which each of the mediator genes was inserted 
exhibit significantly improved CMCase activity 
compared to control vectors in which the signal 
sequence is absent or no mediator genes are fused, 

15 demonstrating that CMCase is exported to the cell 
surface by each mediator (see Table 1) . Especially, 
HpTiplp, HpGaslp and HpCwplp were found to translocate 
CMCase to the cell surface at higher efficiency than 
other mediators (see Fig. 7) . 

20 While the CMCase activity was decreased in most 

of the cells when they reached the stationary phase, 
the CMCase activity of the cells using HpCwplp as a 
surface anchor were found to remain high throughout 
the culturing period, although being lowered in the 

25 stationary phase. Therefore, HpCwplp was inferred to 
be the most stable mediator among the proteins tested 
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(see Fig. 8) . 

In accordance with a further embodiment of the 
present invention, there are provided surface 
expression systems using non GPI-anchor proteins 
5 isolated from Hansenuia polymorpha as mediators for 
surface expression of foreign proteins. 

In one version of this embodiment, an HpPIR2 gene, 
which encodes a cell wall protein different from GPI- 
anchor proteins were used for constructing a surface 
10 expression system. 

In contrast to GPI-anchor proteins, the protein 
Pir2p is anchored in the cell wall through the hyper 
O-glycosylation of its serine/threonine-rich domain 
with the glycans present in the cell wall. Thus, the 
15 protein Pir2p is not released from the cell wall by 
the enzymes, which hydrolyze glycans only, but 
released from the cell wall by alkaline treatment, 
which cleaves all glycosylation . In the present 
invention, surface expression systems utilizing 
20 carboxy- and amino-terminal regions of the protein 
HpPir2p were constructed. To evaluate the anchoring 
ability of the mediators derived from HpPir2p, CMCase, 
the reporter, activity was measured (see Fig. 9) , 

Almost the same CMCase activity was obtained when 
25 the carboxy-terminal region of the protein HpPir2p was 
expressed as a mediator as when CMCase gene without 
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mediator was expressed. By contrast, where the amino- 
terminal region of the protein HpPir2p was used as an 
anchoring mediator, CMCase activity was detected not 
only in the supernatant, but also in the whole cell 
5 fraction (see Table 2) . It was reported that the 
protein Pir2p was secreted to the culture medium by 
the overexpression thereof in vivo (Russo et al., Proc. 
Natl. Acad. Sci. USA, 1992, 89, 3671). Therefore, the 
CMCase activity detected in the whole cell fraction 

10 was attributed by the enzyme anchored in the cell wall . 

Therefore, the protein HpPir2p can be anchored in 
the cell wall and be used as a surface expression 
mediator. Over conventional GPI-anchor proteins, the 
protein Pir2p has the advantage of linking to an amino 

15 terminal of target protein to its carboxy terminus. 

In another version of the embodiment, the WSCl 
gene, which encodes a transmembrane protein different 
from GPI-anchor proteins, was used to construct a 
surface expression system. 

20 The protein Wsclp is composed of a transmembrane 

domain by which the protein is anchored in the cell 
membrane, a serine /threonine-rich domain that 
traverses the cell wall, and an amino-terminal cystein 
motif which exists in the extracellular space (Verna 

25 et al., Proc. Natl. Sci. USA, 1997, 13804), as 
illustrated in Fig. 10a. 
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In order to establish a surface expression system 
using the HpWsclp, several truncated HpWSCl gene 
fragments, devoid of either the cystein motif or the 
transmembrane domain and of both of them were inserted 
5 in the CMCase expression vector (see Fig. 10b) and the 
CMCase activity was analyzed. 

Only where the transmembrane domain was presented 
in the fusion protein, the CMCase was not secreted, 
but rather attached to the cell. However, although the 

10 transmembrane domain was present, CMCase activity was 
not detected in the whole cell fraction unless the 
cystein motif is present. This result might come from 
the fact that the CMCase was not sufficiently exposed 
on the cell surface. In contrast, where the 

15 transmembrane domain was absent, the CMCase activity 
was mostly detected in the supernatant. In addition, 
the cystein motif made an additional contribution to 
the secretion of the CMCase (see Table 3) . 

Therefore, the mediator derived from the HpWsclp 

20 must have the cystein motif and the transmembrane 
domain, both, for anchoring efficiency. In the present 
invention, the HpWsclp was suggested as a novel 
mediator for surface expression of proteins . 

In accordance with still a further embodiment of 

25 the present invention, there are provided surface 
expression systems for exporting glucose oxidase to 

34 
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the cell surface by use of the HpTiplp, HpGaslp and 
HpCwplp, and the HpWsclp. 

Glucose oxidase, a flavoenzyme derived from 
Aspergillus niger, consists of two identical 
5 polypeptide chain subunits. Upon being expressed in 
Hansenula polymorpha, this enzyme is known to undergo 
hyper giycosylation . 

When being expressed in the presence of the 
anchoring mediators such as HpTiplp, HpGaslp, HpCwplp 

10 and HpWsclp, glucose oxidase was found to form smaller 
activity circles on plates (Hodgkins et al . , i^east, 
1993, 9, 625) than the secreted ones (see Fig. 11). 

In all cases of using the anchoring mediators, 
fluorescent cells were observed by FACS (fluorescence 

15 activated cell sorter) analysis, indicating that the 
glucose oxidase was expressed and anchored onto the 
cell surface. Additionally, higher glucose oxidase 
activity was detected in the whole cell fraction than 
in the supernatant throughout all of strains. When 

20 HpCwplp was employed, fluorescence was detected from a 
large number of cells, demonstrating that HpCwplp had 
an excellent cell wall anchoring activity. It was also 
found that the 40 amino acids fragment of HpTiplp was 
efficient in expression of proteins onto the cell 

25 surface (see Fig. 12) . Under the mediation of HpGaslp 
and HpWsclp, fluorescence, which proves the cell wall 
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anchoring activity, was detected, but in a low level. 
However, even in this case, there seemed to be no 
problems in detecting the activity of the enzyme if 
the substrate penetrated the cell wall. 
5 Accordingly, not only is HpCwplp found to be the 

most effective surface expression mediator among the 
GPI-anchor proteins, but also the proteins HpPir2p and 
HpWsclp can be suggested as novel expression mediators 
in accordance with the present invention. 

10 Herein, useful in the present invention is a 

eukaryotic cell selected from the group consisting of 
yeast spp., including Candida spp., Debaryomyces spp., 
Hansenula spp., Kluyveromyces spp., Pichia spp., 
Schlzosaccharomyces spp., Yarrowia spp,, Saccharomyces 

15 spp,; and mold spp. including Aspergillus spp., 
Penicillium spp., and Rhizopus spp. 

In accordance with still another embodiment of 
the present invention, there are provided surface 
20 expression systems using the Saccharomyces 
diastaticus-derived glucoamylase gene STAl. 

With the 2,337 bp sequence represented by the SEQ. 
ID, No: 9, the glucoamylase derived from Saccharomyces 
diastaticus is secreted to extracellular spaces to 
25 hydrolyze extracellular starch. The amino acid 
sequence analysis of this enzyme showed a signal 
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sequence (SS) in the amino-terminal region and 
threonine/serine-rich (TS-rich) domain which 

containing the threonine and serine residues in an 
amount of 55% of the total amino acid residues. The 
5 signal sequence and the threonine/serine-rich domain 
are linked to each other via the octapeptide 
represented by the SEQ. ID. NO: 10, While this 
octapeptide plays a role in directing the protein to 
the exterior, the threonine/serine-rich domain is 

10 responsible for the support of the proteins upon its 
penetration through the cell wall, thus acting like an 
anchoring mediator (Venturini et al . , Mol. Microbiol., 
1997, 23, 997/ Yamashita, I., Agric. Biol. Chem. , 1989, 
53, 4 83; Yamashita et al . , Agric. Biol. Chem., 1984, 

15 48, 1611) . 

With reference to Fig. 13, there are shown vector 
diagrams for the surface expression systems using the 
STAl gene. As depicted, the surface expression systems 
are constructed with vectors containing the signal 

20 sequence only, both of the signal sequence and the 
octapeptide sequence, all of the signal sequence, the 
octapeptide sequence and the threonine/serine-rich 
domain, both of the signal sequence and alpha- 
agglutinin, and all of the signal sequence, the 

25 octapeptide, and alpha-agglutinin, along with a CMCase 
gene as a reporter gene. 
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When employing the signal sequence only, most of 
the CMCase was secreted extracellularly, not anchored 
in the cell wall, as measured by the pNPC (p- 
nitrophenyl p-D-cellobiocide) method (Deshpande et al., 
5 Anal. Biochem. , 1984, 138, 481). In contrast, when 
employing the threonine/serine-rich domain linked to 
the signal sequence via the octapeptide sequence, the 
CMCase activity was low in the supernatant, but high 
in the whole cell fraction. The same results were 

10 confirmed by plate activity assay. Large circles were 
formed around the colonies, which harbor the signal 
sequence only, indicating that the CMCase was mostly 
secreted outside the cells . The presence of the 
threonine/serine-rich domain hindered the enzyme 

15 secretion, as proven by the small circles formed 
around the colonies (see Table 4 and Fig. 14) . 

In addition, fluorescence was detected in the 
cells harboring the threonine/serine-rich domain by 
FACS analysis, suggesting the presence of CMCase on 

20 the cell surface (see Fig. 15) . 

When glucoamylase gene was used for the 
construction of surface expression systems, 
glucoamylase or its domains could be fused to the 
amino terminus of target protein. Thus, these systems 

25 could circumvent the problems arising when a carboxy- 
terminal active domain of the target protein was 
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interrupted by the mediator fused to the carboxy 
terminus . 

Therefore, the mediators described herein allow 

the development of surface expression systems capable 

5 of expressing various types of target proteins, 
according to the characteristics of ■ the target 
proteins . 



EXAMPLES 

10 Practical and presently preferred embodiments of 

the present invention are illustrative as shown in the 

following Examples . 

However, it will be appreciated that those skilled 

in the art, on consideration of this disclosure, may 
15 make modifications and improvements within the spirit 

and scope of the present invention. 



EXAMPLE Ij Isolation of Gen^^s F. ncodinn Cell Wall 

Expression Proteins from H^ nsenula polvmnrpha 

20 For use in the cloning, the strain DLl-L (A Ieu2) , 

derived from Hansenula polymorpha DLl (ATCC 26012) was 
obtained from NPO Biotechnologia (Moscow, Russia) . 
Cell wall protein profile analysis by biotin labeling, 
which cannot penetrate into the cell wall, 

25 demonstrated that this strain had very similar profile 
of surface proteins with Saccharomyces cerevisiae. For 
39 
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the cloning of cell wall protein genes from Hansenula 
polymorpha, the genes encoding the glucanase 
extractable cell wall proteins of S. cerevisiae were 
utilized. 

5 In order to isolate the cell wall protein gene 

from Hansenula polymo2:pha , genes coding for Cwplp 
(GenBank D37975, van der Vaart et al,, J. Bacterial. , 
1995, 177, 3104), ,Cwp2p (GenBank Z28096, van der Vaart 
et al., J. Bacteriol. , 1995, 177, 3104), Sedlp 

10 {GenBank X66838, Seidel, J. and W. Tanner, Yeast, 1997, 
13, 3104), Tirlp (GenBank X12775) , and Tiplp (GenBank 
M71216, van der Vaart et al., J. Bacteriol. , 1995, 177, 
3104), all known as cell wall expression proteins, and 
Gaslp (GenBank X53424, Benghezal et al . , J. Cell Biol., 

15 1995, 130, 1333) known as a cytoplasmic membrane 
protein, were obtained from S. cerevisiae. Reportedly, 
these proteins can be separated from the cell wall by 
glucanase treatment because their carboxy termini are 
linked to (1-1, 6-glucan of the cell wall via GPI-anchor 

20 (Kapteyn et al . , Biochim. Biophys . Act., 1999, 1426, 
373; Kolla et al . , J. Biol. Chem., 1997, 272, 17762). 
Using these genes as probes, the genome of Hansenula 
polymorpha was subjected to southern blot analysis. No 
clear signals were obtained from the genomic D^5A of 

25 Hansenula polymorpha when the Cwplp gene of S. 
cerevisiae was used as a probe. This was inferred to 
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result from low homology of the gene between the two 
strains. In the cases of the other four genes, clear 
signals were observed. From the genomic library, 
corresponding DNA fragments were cloned. 

5 

<1-1> Isolation of HpSEDl gene of Hansenula polymoxrpha 

In order to isolate the SEDl homologue from 
Hansenula polymorphs, the open reiading frame of the 
Saccharomyces cerevisiae SEDl gene was labeled 

10 according to the Klenow fragment method by a DIG 
labeling system (Boehringer Mannheim) and this was 
used as a probe for Southern blot analysis. Following 
hybridization at 42°C in a 30% formamide solution, 
signal was detected. 

15 The southern blot analysis results are given in 

Fig. 1. From the EcoRI fragment, a signal was detected 
at a size of about 4 kb. After the elution of DNA 
around 4 kb from the gel, the DNA fragment was 
inserted to pBluescript II SK( + ) at a EcoRI site to 

20 construct a library. Under the same condition, 
southern blot analysis was conducted with the library 
to obtain a recombinant vector in which a DNA fragment 
hybridized with the probe was inserted. The DNA 
sequence analysis of the recombinant vector, named 

25 HpSEDl r revealed that the 4 kb DNA fragment has a base 
sequence represented by the SEQ. ID. NO: 1 with an 
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open reading frame consisting of 131 amino acids 
represented by the SEQ. ID. NO: 11. The HpSEDl gene of 
Hansenula polymorpha was found to be far shorter than, 
but have the same amino acid repeating sequences as 
5 the corresponding gene of S. cerevisiae and show a 
similar threonine/serine-rich structure, sharing 
homology of 58.4% with Sedlp of 5. cerevisiae. In 
addition, the HpSedlp of Hansenula polymorpha was 
expected to have a signal sequence consisting of 17 

10 amino acids, and a GPI-anchor signal at its carboxyl 
terminus, as analyzed by the PSORT II prediction 
program. A novel E. coll transformant harboring a 
recombinant vector in which the 4 kb EcoRl fragment 
containing the HpSEDl gene of Hansenula polymorpha is 

15 inserted, was deposited with the Korean Collection for 
Type Culture of Korea Research Institute of Bioscience 
and Biotechnology (KRIBB) under the accession No. KCTC 
0825BP on July 11, 2000. 

20 <l-2> Isolation of HpPIR2 gene of Hansenula polymorpha 
In order to isolate the TIRl homologue from 
Hansenula polymorpha, the open reading frame of the 
Saccharomyces cerevisiae TIRl gene was labeled 
according to the Klenow fragment method by a DIG 

25 labeling system (Boehringer Mannheim) and this 
fragment was used as a probe for southern blot 
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analysis. Following hybridization at 42"'C in a 30% 
formamide solution, signal was detected. 

The southern blot analysis results are given in 
Fig. 2b. From the Clal digested fragment, a signal was 
5 detected at about 5kb (lane 9) . After the elution of 
this part from the gel, the DNA fragment was inserted 
to pBluescript II SK( + ) at a Clal site to construct a 
recombinant vector library. Using the same method as 
<1-1>, a recombinant vector containing a 5 kb Clal DNA 

10 fragment hybridized with the probe was obtained. The 
DNA sequence analysis of this 5 kb Clal DNA fragment, 
revealed an ORF that the shares homology of 51.2% with 
the Pir2p of S. cerevisiae. 

To clone the CWP2 homologue from H. polymorpha, 

15 the open reading frame of the S. cerevisiae CNP2 gene 
was labeled according to the Klenow fragment method by 
a DIG labeling system and used as a probe. Southern 
hybridization was conducted at 42°C in a 30% formamide 
solution to detect signals. The southern blot analysis 

20 results are shown in Fig. 2a. From a 5.5 kb Sail 
fragment, an apparent signal was detected. This 5.5 kb 
Sail fragment was identified to have the same DNA 
sequence with the Clal fragment, as analyzed by DNA 
sequencing. Both of the proteins, Cwp2p and Tirlp, 

25 were found to contain the repeating amino acid 
sequence represented by the SEQ. ID. NO: 2, called 
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PIRl/2/3 repeat. It was inferred that these repeating 
sequences might share high homology with the HpPIR2 
gene of Hansenula polymorpha, thus resulting in the 
hybridization of the HpPIR2 gene with the probe. To 
5 exclude this possibility, each gene deprived of the 
repeating sequences was used as a probe for southern 
blot analysis, but no clear signals were observed. 

Encoded by the PIR2 gene isolated from S. 
cerevisiae, the Pir2p protein was reported to be 

10 released from the cell wall by alkali treatment and 
not released by glucanase treatment, which meant that 
this protein anchored onto the cell wall in a 
different form from that of the GPI (glycosyl- 
phosphatidylinositol) -anchor protein, also it was 

15 known that this protein had a Kex2 cleavage site (Toh- 
e et al.. Yeast, 1993, 9, 481). The N-terminal amino 
acids analysis of major cell wall protein released by 
by 30 mM sodium hydroxide (NaOH) from Hansenula 
polymojrpha cell wall (Mrsa et al.. Yeast, 13, 1145- 

20 1154, 1997), showed the same amino acids sequence with 
the HpPir2p polypeptide following the kex2 cleavage 
site, represented by the SEQ. ID. NO: 3. Thus, the 
gene HpPIR2 obtained from Hansenula polymorpha was 
inferred to code for the protein HpPir2p, which 

25 translocates to the cell wall. 

The Pir2p protein of Hansenula polymorpha was 

44 
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encoded by a 1,014 bp gene, represented by the SEQ. ID. 
NO: 4, consisting of 337 amino acids with PIRl/2/3 
repeats, represented by the SEQ. ID. NO: 12. This 
protein possesses a secretion signal sequence 
5 consisting of 18 amino acids and a Kex2 cleavage site 
at amino acid residue 68 . A novel E. coli transf ormant 
transformed with a recombinant vector in which the 5.5 
kb Sail fragment of the HpPIR2 gene of Hansenula 
polymorpha is inserted was deposited with the Korean 
10 Collection for Type Culture of Korea Research 
Institute of Bioscience and Biotechnology (KRIBB) 
under the accession No. KCTC 0827BP on July 11, 2000. 

<l-3> Isolation of HpGASl gene of Hansenula polymorpha 
15 S. cerevisiae GASl gene was labeled by the Klenow 
fragment method with the aid of a DIG labeling system 
(Boehringer Mannheim) and used as a probe for southern 
blot analysis. Following hybridization at 42°C in a 
30% formamide solution, signal was detected. 
20 From the Sail fragment, a signal was detected at 

a size of about 1.6 kb. To identify the full length of 
the gene, the genome was treated with other 
restriction enzyme and hybridized with the same probe 
at 42"»C in 20% formamide. As shown in Fig. 3, a signal 
25 was detected at a 3kb PstI fragment. This PstI 
fragment was cloned and inserted to pBluescript II 
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SK(+) at a PstI site to construct a recombinant vector, 
named HpGASl. The DNA sequence analysis of the 
recombinant vector revealed that the HpGaslp of 
Hansenula polymorpha, encoded by the gene represented 
5 by the SEQ. ID. NO: 5, 1,614 bp long, consisted of 537 
amino acids represented by the SEQ. ID. NO: 14, and 
shared homology of as high as 70.7% with the Gaslp of 
S. cerevisiae. The HpGaslp of Hansenula polymorpha was 
also found to possess a secretion signal sequence 

10 consisting of 18 amino acids, but GPI-anchor signal 
was different with that of S. cerevisiae. 

A novel E. coll transformant harboring a 
recombinant vector in which the 3 kb PstI fragment 
containing HpGASl gene of Hansenula polymorpha was 

15 deposited with the Korean Collection for Type Culture 
of Korea Research Institute of Bioscience and 
Biotechnology (KRIBB) under the accession No. KCTC 
0828BP on July 11, 2000. 

20 <l-4> Isolation of HpTIPl gene of Hansenula polymorpha 
The open reading frame of the Saccharomyces 
cerevisiae TIPl gene was labeled according to the 
Klenow fragment method by a DIG labeling system 
(Boehringer Mannheim) and used as a probe for southern 

25 blot analysis for screening the TIPl homologue from 
Hansenula polymorpha. Following hybridization at 42'*C 
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in a 30% formamide solution, southern blot signal was 
detected. 

The southern blot analysis results are given in 
Fig. 4. At a 3.5 kb Xbal fragment, an apparent signal 
5 was detected. This 3.5 kb fragment was inserted to 
pBluescript II SK( + ) at a Xbal site to construct a 
recombinant vector, named HpTIPl. The DNA sequence 
analysis of the recombinant vector revealed that the 
cloned gene had an 852 bp sequence represented by the 

10 SEQ. ID. NO: 6, coding for 283 amino acids represented 
by the SEQ. ID. NO: 13. A novel E. coli transformant 
harboring a recombinant vector containing the 3.5 kb 
Xbal fragment of the HpTIPl gene was deposited with 
the Korean Collection for Type Culture of Korea 

15 Research Institute of Bioscience and Biotechnology 
(KRIBB) under the accession No. KCTC 0824BP on July 11, 
2000. 

<l-5> Isolation of HpCWPl gene of Hansenula polymorpha 
20 The open reading frame of the Saccharomyces 

cerevisiae CWPl gene was labeled according to the 
Klenow fragment method by a DIG labeling system 
(Boehringer Mannheim) and used as a probe for Southern 
blot analysis for screening the CWPl homologue from 
25 Hansenula polymorpha. Following hybridization at 42''C 
in a 30% formamide solution, southern blot signal was 
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detected. 

As shown in Fig. 5, several weak signals were 
detected. Among the various signals, a 61cb Sail 
fragment showing the most clear signal was cloned into 
5 pBluescript II SK( + ) at a Sail site to construct a 
recombinant vector, named HpCWPl. DMA sequence 

analysis disclosed that the cloned gene in the 
recombinant vector has a 24 6 bp sequences, which is 
expected to be an open reading frame represented by 

10 the SEQ. ID. NO: 7 with very low homology with the 
Cwplp of S. cerevisiae. Additionally, the amino acid 
sequence, represented by the SEQ. ID. NO: 15, was 
expected to have a secretion signal consisting of 15 
amino acids and a GPI-anchor signal predicted by the 

15 POSRT II prediction program. The corresponding protein 
was named HpCwplp. It was expected to be a cell wall 
protein by its amino acids structure. A novel E. coli 
trans formant harboring a recombinant vector containing 
the 6 kb Sail fragment coding for the HpCwplp of 

20 Hansenula polymorpha was deposited with the Korean 
Collection for Type Culture of Korea Research 
Institute of Bioscience and Biotechnology {FCRIBB) 
under the accession No. KCTC 0826BP on July 11, 2000. 

25 <l-6> Isolation of HpWSCl gene of Hansenula polymorpha 
Based on the fact that fluorescence was observed 
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on the cell surface when a green fluorescence protein 
(GFP) was expressed as a fusion partner with Wsclp, it 
was deduced that the amino terminal domain of Wsclp 
could be used as a surface-expression mediator. 
5 In the process of isolating the LEU2 gene of 

Hansenula polymorphs , the HpWSCl gene was obtained by 
Agaphonov et al. {Agaphonov et al., i^east, 1994, 10, 
509, GenBank U00889) . DNA sequencing analysis 
disclosed that the HpWSCl gene of Hansenula polymorpha 

10 consists of the 1,110 bp sequence represented by the 
SEQ. ID. NO: 8 encoding a 373 amino acid sequence. The 
protein encoded by the HpWSCl gene isolated from 
Hansenula polymorpha is a cell surface protein 
different in anchoring mechanism from GPI-anchor 

15 proteins and can be used to develop surface expression 
systems by its amino terminal domain. 

Example; 2 ; surface Express ion System Using Cell Wall 

Proteins Isolated from Hans enula polymorphs 

20 

<2-l> Construction of surface expression system 

Surface expression systems were constructed in 
which the four GPI-anchor protein genes and the HpWSCl 
gene, isolated from Hansenula polymorpha in Example 1, 
25 were used as mediator genes for surface expression of 
foreign proteins. Together with each mediator gene, a 
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CMCase gene derived from Bacillus subtilis (Park et 
al., Agric. Biol. Chem. , 1991, 55, 441) was used in 
the surface expression systems, functioning as a 
reporter gene . 

5 In this regard, a GAPDH promoter (Sohn et al., 

Appl. Microbiol. Biotechnol., 1999, 51, 800) and a 
killer toxin signal sequence (Sor, F. and Fukuhara, 
Curr. Genet, r 1985, 9, 147) were inserted into an 
AMIpLl vector (Agaphonov et, al., yeast, 1999, 15, 541). 
Then, the CMCase gene was fused in frame to give a 
CMCase surface expression vector, named pGK CMCF, 
which could express CMCase under the control of the 
GAPDH promoter, as shown in Fig. 6. The CMCase gene 
stretch corresponding to an amino acid sequence from 
glycine at position 31 to aspartic acid at position 
355 was obtained by PCR and linked downstream of the 
killer toxin signal sequence in the expression vector. 
As seen in Fig. 6, mediator genes were inserted into 
the carboxyl region of the CMCase gene between the 
recognition sites of restriction enzymes BamHI/Hindlll . 
Also, the expression vector was designed to have an 
ARS, HARS36 which directs the integration of vector to 
the telomeric regions of the chromosome of a host cell, 
so that comparison could be made of the translocation 
efficiency of surface anchor proteins (Sohn et al., J. 
Bacterial., 1996, 178, 4420) . 
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Four putative GPI-anchor protein genes, HpCWPl, 
HpGASl, HpTIPl and HpSEDl gene fragments, were 
amplified from the recombinant vectors by PCR using 
appropriate primer sets designed to provide BamHI and 
5 Hindlll sites for the PCR products. The amplified 
putative GPI-anchor protein genes were inserted to the 
expression vector pGK CMCF at the BamHI/Hindlll site 
located in the carboxy terminal region of the CMCase 
gene to construct CMCase surface expression vectors 

10 for HpCWPl^ HpGASl, HpTIPl and HpSEDl, named CwpF, 
GasF, TipF and SedF, respectively. 

Using primer sets containing BamHI/Hindlll sites, 
four nucleic acid segments, each encoding the sequence 
of the 4 0 carboxy-terminal amino acids of each protein, 

15 were amplified by PCR, and inserted into the 
expression vector to construct CMCase surface 
expression vectors, named Cwp40, Gas40, Tip40 and 
Sed40, respectively. 

For comparison of the surface anchoring 

20 efficiency, a DNA fragment, which encodes a carboxy- 
terminal sequence of 92 amino acids of Cwp2p, known as 
a GPI-anchor protein from S. cerevisiae excellent in 
anchoring efficiency, was synthesized by PCR and 
inserted into the same vector to construct a CMCase 

25 surface expression vector for Cwp2p, named ScCwp2, 
which was used as a control . 
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CMCase surface expression vectors containing the 
mediator protein genes were transformed into Hansenula 
polymorpha DLl-L (Hill et al., Nucl. Acids Res., 1991, 
19, 5791) and the transformants were selected in 
5 minimal synthetic media (2% glucose, 0.67% amino acid- 
free yeast nitrogen base, various amino acids with 
appropriate concentrations) devoid of leucine. 

<2-2> Measurement of surface anchored CMCase activity 

10 Each of the selected clones was inoculated in a 

YPD medium (2% glucose, 2% peptone, 1% yeast extract) 
and cultured at 37''C for 18 hours, after which the 
CMCase activity of the supernatant and the whole cell 
fraction were measured according to the DNS 

15 (dinitrosalicylic acid) method (G. L. Miller, Anal, 
Biochem., 1959, 31, 426). 

To test the CMCase activity, the culture broth 
was divided into a supernatant and a whole cell 
fraction by centrifugation. The whole cell fraction, 

20 after being washed twice with a citrate buffer (10 mM 
citrate buffer, pH 5.0), was suspended in the same 
buffer. The resulting suspension was added in a 1% CMC 
(carboxy methyl cellulose) solution and incubated at 
55''C for 30 min. Reaction was stopped by boiling for 5 

25 min in a DNS solution, followed by the measurement of 
absorbance at 550 nm. The results are given in Table 1, 
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below. The enzyme activity of CMCase was defined by U, 
and one U is the amount of enzyme activity which 
releases 1 \t mole of glucose from CMC for 1 min. 



5 TABLE 1 

Surface Expression Activity of CMCase According to 
anchoring Mediator 



MCMC: 



Anchoring Activity Activity in 

..... in Supernatant Whole Cell Fractio 
(U/ml) (U/ml) 



Sed4 0 


0. 


,40 


0. 


,059 


SedF 


0. 


,41 


0. 


,030 


Tip4 0 


0. 


,92 


0. 


,090 


TipF 


0. 


,45 


0. 


.024 


Gas40 


0. 


.19 


0, 


.010 


GasF 


0. 


.61 


0, 


.080 


Cwp40 


1. 


.10 


0, 


.150 


CwpF 


0. 


,65 


0, 


.110 


MCMC 


0. 


.14 


0, 


.003 


CMC* 


1 . 


.92 


0. 


.000 


ScCwp2 


0, 


.15 


0, 


.020 


DLl 


0, 


.00 


0, 


.000 



Transformant 



harboring a CMCase expression 
vector with no secretion signal sequence. 
CMC*: Transformant harboring a CMCase expression vector 
with a secretion signal sequence, a CMCase gene, and a 
stop codon instead of anchoring mediators 
DLl: wild type cell with no expression vector. 



In supernatants fraction, high CMCase activity 
was observed. This was believed to be attributed to 
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the fact that the CMCase expressed by the action of 
the GAPDH promoter was too much to properly anchored 
on the cell surface. Also, CMC, used as a substrate 
for CMCase activity assay, has the high molecular 
5 weight, so the enzyme anchored in the cell could not 
be effectively accessed to CMC. In whole cell 

fractions, the strains transformed with the vectors 
containing the mediator genes exhibit significantly 
improved CMCase activity, compared to those 

10 transformed with control vectors, MCMC devoid of the 
secretion signal sequence, and CMC* which had no 
mediator genes, thus demonstrating that CMCase was 
exported to the cell surface by each mediator. In 
addition, in transformant strains containing Tip40, 

15 GasF, Cwp4 0p and CwpF, CMCase were found to 
translocate to the cell surface at higher efficiency 
than those containing other mediators, as depicted in 
Fig. 7. 

CMCase activity was also monitored with regard to 
20 culture time periods, and the results are graphed in 
Fig. 8. CMCase activity tended to decrease in most of 
the cells when they reached the stationary phase. But 
the CMCase activity of the cells, which harbored the 
surface expression vector having the HpCWPl gene, 
25 although being lowered in the stationary phase, was 
found to remain high throughout the culturing period 
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in the whole cell fraction. Therefore, HpCwplp is 
inferred to be the most stable mediator among the 
proteins tested, as depicted in Fig. 8. 

From the data obtained, HpCwplp was identified to 
5 be superior in surface expression efficiency and 
stability among the novel four GPI-anchor proteins 
isolated from Hansenula polymorpha . 

EXAMPLE 3: Surface Expressi on System tJsina HpPirPp 

10 <3-l> Construction of surface expression system 

A surface expression vector was constructed in 
which the HpPIR2 gene encoding a cell wall expression 
protein different from GPI-anchor proteins was used as 
a mediator for surface expression of foreign proteins 

15 with CMCase as a reporter. To determine which part of 
the protein HpPir2p is useful for expressing target 
proteins on the cell wall, CMCase was fused to HpPir2p 
at its either termini, as shown in Fig. 9. 

In order to utilize the carboxy terminus of 

20 HpPir2p for fusion with a target protein, a gene 
fragment encoding a polypeptide stretch ranging from 
the Kex2 cleavage site to the stop codon of the 
protein HpPir2p was synthesized by PCR and inserted to 
the expression vector, pGK CMCF constructed in Example 

25 2, at the restriction enzyme sites (BamHI/Hindlll) to 
give a recombinant expression vector, named Pir-C. In 
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the case of the amino terminus of HpPir2p, the 
expression vector, pGK CMCF was deprived of the killer 
toxin signal sequence and replaced with a gene 
fragment encoding the full length of the open reading 
5 frame from the initiator methionine to the stop codon 
of the HpPir2p, so as to fuse HpPIR2 gene directly to 
the amino terminus of CMCase. The resulting surface 
expression vector was named Pir-N. The expression 
vectors were transformed into Hansenula polymorpha, 
10 followed by selecting transfoirmants in the same manner 
as in above. 

<3-2> Activity measurement of surface anchored CMCase 

The CMCase activity of the transformants selected 
15 in Example 3-1 was measured, after being cultured in 
YPD media, and the results are given in Table 2, below. 

TABLE 2 

CMCase Activity of Hansenula polymorpha Using HpPir2p 
20 as Surface Expression Mediator 



Mediator 


Activity 
in Supernatant 

(U/ml) 


Activity in 
Whole Cell Fraction 
(U/ml) 


Pir-C #1 


3.33 


0.010 


Pir-C #2 ■ 


2.80 


0.009 


Pir-N #1 


1.83 


0.080 


Pir-N #2 


1.86 


0.084 


CMC* 


2.53 


0.000 
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As shown in Table 2, almost the same CMCase 
activity was measured when the carboxy-terminal region 
of the protein HpPir2p was used as mediator (Pir-C), 
5 as when a CMCase gene with a stop codon was used, 
indicating that the carboxy-terminal region of the 
protein HpPir2p was completely deficient in surface 
anchoring ability. Also, the CMCase activity was not 
affected by the insertion of the protein HpPir2P. By 

10 contrast, where the amino-terminal region of the 
protein HpPir2p was used as a mediator (Pir-N) , CMCase 
activity was detected not only in the supernatant, but 
also in the whole cell fraction, demonstrating that 
the CMCase was expressed on the cell wall. 

15 Therefore, the protein HpPir2p was anchored in 

the cell wall and thus could be used as a surface 
expression mediator. Over conventional GPI-anchor 
proteins, the protein HpPir2p has the advantage of 
linking the amino terminus of a target protein to its 

20 carboxy terminus. This anchoring protein is useful to 
avoid the problem of the activity loss took place when 
the target protein has an active site in its carboxy- 
• terminal region. 



25 EXAMPLE 4: Surface? Expr Pssion System rising HpWsrlp 
<4-l> Construction of surface expression system 
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The protein HpWsclp is composed of a 
transmembrane domain by which the protein is anchored 
in the cell membrane, a serine/threonine-rich domain 
that traverses the cell wall, and an amino-terminal 
5 cystein motif which exists in the extracellular space, 
functioning to detect external signals (Verna et al., 
Proc. Natl. Sci. USA, 1997, 13804), as illustrated in 
Fig. 10a. 

In this example, the HpWSCl gene was used for 

10 constructing a surface expression system. In this 
regard, CMCase was fused to the amino terminus of 
HpWsclp and used as a reporter protein exposed on the 
cell surface. 

To identify which domain is responsible for cell 

15 wall anchoring, truncated HpWSCl gene fragments were 
synthesized by PCR using primer sets designed to have 
BamHI/Hindlll sites. Four truncated HpWSCl gene 
fragments, devoid of either the cystein motif (wsc-t) 
or the transmembrane domain (Wsc-c) , and of both of 

20 them (Wsc-o) , and an intact WSCl gene (Wsc-ct) 
encoding the full length of HpWSCl, were inserted at 
the BamHI/Hindlll site of the plasmid vector {see Fig. 
10b) . The recombinant vectors were transformed into 
Hansenula polymorpha DLl-L, followed by selecting 

25 transformants in the same manner as in above. 
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<4-2> Activity measurement of surface anchored CMCase 

After 18 hours culture in YPD broth, the CMCase 
activity of the selected transforroants was measured 
according to the DNS method, and the results are given 
5 in Table 3, below. 



TABLE 3 

CMCase Activity of Hansenula polymorpha Strains Using 
HpWsclp as Expression Mediator 



Expression Activity Activity in Whole 
Mediator Supernatant Cell Fraction 

(U/ml) (U/ml) 

Wsc-c 0.90 0.002 
Wsc-ct 0.35 0.069 
Wsc-o 2.40 0.003 
Wsc-t 0.04 0.006 
CMC* 2.54 0.000 
DLl 0.00 0.000 



As seen in Table 3, only when the transmembrane 
domain was presented in the fusion protein (Wsc-ct, 

15 Wsc-t), the CMCase was not secreted, but rather 
remained attached to the cell . However, although the 
transmembrane domain was present, CMCase activity was 
not detected in the whole cell fraction if the cystein 
motif was absent (Wsc-o, Wsc-t) . The result was 

20 believed to be attributed to the fact that the CMCase 
was not sufficiently exposed on the cell surface. In 
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contrast, when the transmembrane domain was absent 
(Wsc-o, Wsc-c) , the CMCase activity was mostly 
detected in the supernatant. In addition, the cystein 
motif was found to make an additional contribution to 
5 the secretion of the CMCase. 

Therefore, the mediator derived from the HpWsclp 
must have both, the cystein motif and the 
transmembrane domain for anchoring efficiency. As the 
activity detected in the whole cell fraction was 
10 almost the same as in the case of GPI-anchor proteins, 
the HpWsclp was suggested as a novel mediator for 
expression proteins on the cell surface. 

EXAMPLE 5: Surface Expression of r;iucose OxidasP 
15 <5-l> Construction of surface expression system 

Among the mediator proteins tested, three GPI- 
anchor proteins, HpTiplp (Tip40), HpGaslp (GasF) and 
HpCwplp (CwpF) , and the HpWsclp (Wsc-ct) were tested 
for the surface expression of glucose oxidase. For the 
20 surface expression of glucose oxidase, a glucoamylase 
signal sequence and a glucose oxidase gene were 
synthesized by PGR and fused to each other before 
being cloned into pBluescript II SK( + ) . pGK CMCF 
vectors in which the expression mediator genes were 
25 included were treated with EcoRI and BamHI to remove 
the killer toxin signal sequence and the CMCase gene. 
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followed by the insertion of the glucose oxidase 
expression cassette to give a surface expression 
vector pGA GOD. This vector was transformed into 
Hansenula polymorpha DLl-L and transf ormants were 
5 selected in the same manner as in the above CMCase 
case. 

<5-2> Activity assay of surface anchored glucose 
oxidase 

10 The transformants selected in Example 5-1 were 

transferred onto YPD plate media and measured for 
glucose oxidase activity according to the plate 
activity assay method using peroxidase and O- 
dianisidine (Hodgkins et al., yeast, 1993, 9, 625). As 

15 shown in Fig. 11, a smaller glucose oxidase activity 
circle was formed when expressing glucose oxidase with 
the anchoring mediators (Tip4 0, GasF, CwpF and Wsc-ct) 
than glucose oxidase without them (GOD*) . 

After being cultured in YPD media, the 

20 transformants, which were identified to have glucose 
oxidase activity on the cell surface by the plate 
activity assay method, were subjected to FACS analysis 
along with the wild type. The FACS results are given 
in Fig. 12. In this regard, the cultured cells were 

25 washed twice with phosphate buffered saline (PBS, pH 
6.5) and suspended in PBS containing 1% BSA (bovine 
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serum albumin) . The suspension was added with a 
glucose oxidase antibody (Accurate Chemicals) and 
allowed to stand for 1 hour on ice, after which the 
cells were washed three times with the same buffer and 
5 resuspended. A secondary antibody (FITC-labeled anti- 
rabbit antibody, Sigma) was added to the suspension 
which was incubated on ice for 30 min, followed by 
washing the cells three times with the same buffer. 

In all cases of using the anchoring mediators, 

10 fluorescent cells were observed by FACS (fluorescence 
activated cell sorter) analysis, indicating that the 
glucose oxidase was expressed and anchored onto the 
cell surface. Additionally, higher glucose oxidase 
activity was detected in the whole cell fraction than 

15 in the supernatant throughout all of the strains . On 
the whole, the activity of surface anchored glucose 
oxidase was found to agree with that of CMCase. When 
CwpF was employed, fluorescence was detected from a 
large number of cells, demonstrating that CwpF had an 

20 excellent cell wall anchoring activity. It was also 
found that Tip40 was efficient in expression proteins 
in cell surfaces, as analyzed by FACS (Fig. 12) . Under 
the mediation of GasF and Wsc-tc, fluorescence, as 
evidence of the cell wall anchoring, was detected, but 

25 in a low level, which resulted, to our knowledge, from 
the fact that GasF and Wsc-tc both are cytoplasmic 
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membrane proteins so that their exposure on the cell 
surface is poor compared to the other mediators. 
However, even in this case, there seems to be no 
problem in detecting the activity of the enzyme if the 
5 substrate penetrates the cell wall . 

The assay results obtained from two enzymes 
anchored in the cell wall by use of various mediators, 
taken together, demonstrates that HpCwplp is the most 
effective • surface anchoring mediator among the GPI- 
10 anchor proteins and the proteins HpPir2p and HpWsclp 
can be suggested as novel surface anchoring mediators. 

E XAMPLE Surface E?<:pression System Using 

Glucoamylase Gene 

15 <6-l> Construction of surface expression system 

A known cell wall protein was investigated as to 
its usefulness as an expression mediator. The 
glucoamylase gene STAl, derived from Saccharomyces 
diastaticus, was obtained and studied for its 

20 usefulness to construct a surface expression system in 
Saccharomyces cerevisiae. 

To begin with, various truncated STAl gene 
fragments were synthesized by PGR using appropriate 
primers and inserted to pBluescript II SK(+) at EcoRI 

25 and Smal sites. The cloning was confirmed by DNA 
sequencing. The recombinant vectors were digested with 
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EcoRI and Hindlll to create the site to which the 
GAPDH promoter of Saccharomyces cerevlsiae could be 
inserted. After completion of the insertion/ the 
digestion of the recombinant vectors with Kpnl and 
5 Ehel made various STAl genes fused with the GAPDH 
promoter, which were inserted into the expression 
vector YEp352 (Hill et al.. Yeast, 1986, 2, 163) at 
KpnI/PstI sites to construct CMCase surface expression 
vectors , 

10 With reference to Fig. 13, there are shown vector 

diagrams for the surface expression systems using the 
STAl gene. As depicted, the surface expression systems 
were constructed with vectors comprising the signal 
sequence only (STASS) , both of the signal sequence and 

15 the octapeptide sequence (STASS-8) , all of the signal 
sequence, the octapeptide sequence and the 
threonine/serine-rich domain (STASS-8-TS) , both of the 
signal sequence and the threonine/serine-rich domain 
(STASS-TS), both of the signal sequence and alpha- 

20 agglutinin (STASS-CMC-Agalc) , and all of the signal 
sequence, the octapeptide, and alpha-agglutinin 
{ STASS-8 -CMC-Agalc ) , along with a CMCase gene as a 
reporter gene. The CMCase gene was synthesized by PCR, 
cut with Smal/PstI and inserted into the expression 

25 vectors. The resulting expression vectors were 

introduced into Saccharomyces cerevlsiae L3262, which 
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were then cultured in a minimal plate medium deficient 
in uracil to select the transformant , which harbored 
the expression vectors. 

5 <6-2> Activity measurement of surface anchored CMCase 

The transformants selected in Example 6-1 were 
assayed for CMCase activity after being cultured at 
SO'C for 48 hours in YPD media. The assay for CMCase 
activity was conducted by the pNPC (p-nitrophenyl p-D- 

10 cellobiocide) method (Deschpande et al.. Anal. 
Biochem. , 1984, 138, 481). For this, the culture broth 
of each transformant was fractioned into a supernatant 
and a whole cell fraction by centrif ugation . The cell 
mass was washed three times with PBS and suspended in 

15 PBS. The suspension was mixed with a 2.5 mM pNPC 
solution and incubated at 37°C for 1 hour, followed by 
adding an equal volume of 2% sodium carbonate {Na2C03) 
to stop the reaction. Absorbance was measured at 410 
nm, and the results are given in Table 4, below. In 

20 Table 4, the enzyme activity of CMCase was defined by 
U, one U being defined as the amount enzyme activity 
to releases 1 n mole of pNP for 1 min. 

TABLE 4 

25 CMCase Activity in Saccharomyces cerevisiae 
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ss-CMCssS-CMCssts-CMCssSts-CMC 


Activity 
in Supernatant (U/1) 


339 


389 


189 


145 


Activity 
in Whole Cell Fraction (U/1) 


13.6 


14.5 


112 


128 


Activity in Whole Cell Fraction 
Activity in Supernatant (%) 


3.9 


3.6 


37 


47 



As shown in Table 4, when employing the signal 
sequence only (SS-CMC) , most of the CMCase was 
secreted extracellularly, not anchored in the cell 
5 wall. In contrast, when employing the 

threonine/serine-rich domain linked directly to the 
signal sequence (SS-TS-CMC) or when employing the 
threonine/serine-rich domain linked to the signal 
sequence via the octapeptide sequence (SS-8-TS-CMC) , 

10 the enzyme activity was measured to be low in the 
supernatant, but high in the whole cell fraction. 

The same results were confirmed by plate activity 
assay, as shown in Fig. 14. Large circles were formed 
around the colonies, which harbored the signal 

15 sequence only (SS) , as most of the expressed enzyme 
was extracellularly secreted. In contrast, the 

presence of the threonine/serine-rich domain hindered 
the enzyme secretion as proven by the small circles 
formed around the colonies . 

20 To confirm the transport of CMCase to the cell 

surface, FACS analysis for CMCase was conducted in the 
same manner as in Hansenula polymorpha. The results 
are given in Fig. 15. As observed by the FACS diagrams, 
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fluorescence was detected in the cells harboring the 
threonine/serine-rich domain, suggesting the presence 
of CMCase on the cell surface. Meanwhile, after CMCase 
was fused to the carboxy terminus of a-agglutinin, 
5 CMCase activity was measured to test the surface 
anchoring ability. However, CMCase activity in this 
case was not higher than in case using only the 
threonine/serine-rich domain. Therefore, the 

threonine/serine-rich domain of the STAl gene alone 
10 was identified to function as a good surface anchoring 
mediator . 

Thus, when used for the construction of surface 
expression systems, glucoamylase or its domains could 
be fused to the amino terminus of a target protein. 

15 Thus, these systems may circumvent the problems 
arising when a carboxy-terminal active domain of the 
target protein is interrupted by the mediator fused to 
the carboxy terminus. Therefore, the mediators 
described herein allow the development of surface 

20 expression systems capable of expressing various types 
of target proteins, according to the characteristics 
of the target proteins. 



25 



INDUSTRIAL APPLICARTT.TTY 

As described hereinbefore, surface expression 
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proteins derived from Hansenula polymorpha, an 
industrially useful methylotropic yeast, are highly 
effective in construction of surface expression 
systems and development of biocatalyst application 

•5 systems . Because of their ability to produce 

industrially useful biomaterials, such as enzymes, 
antigens, antibodies, etc., and to offer industrial 
production tools such as immobilized biocatalysts, the 
surface expression systems established in the present 

10 invention can find numerous applications in various 
industries, including the medical industry, the food 
industry and the chemical and biochemical industry. 

The present invention has been described in an 
15 illustrative manner, and it is to be understood that 
the terminology used is intended to be in the nature 
of description rather than of limitation. Many 
modifications and variations of the present invention 
are possible in light of the above techniques. 
20 Therefore, it is to be understood that within the 
scope of the appended claims, the invention may be 
practiced otherwise than as specifically described. 
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milJAIKST TllliATV ON IIIK INTKIINATIOKAI. lUsUKWITION Ol' TllK lll-IHiSII 
OK MIUlOylllJANISMS KOK Tllli l-UWUSli UP IWIEKT I'HCXfiWJllIi 

INTERNATIONAL FORM 

RECEIPT IN TIIE CASE OF AN ORIGINAL DEPOSIT 

issued pursuant to Rule 7.1 

TO : aO, Ed-Sung 

Dasol Apt. 102-507. #395-3. Kung-dong. Yusong-ku. Taejon 305-3:i'>. 
Uepublic of Korea/ 



1 . IDENTIFICATION 01- THE MICUOORGANISM 


Identification reference given by the 

nEPOsn-o«: 

ICscherichia coli 
I>IlS@/IIpTll>l 


Accession number given by the 
INTERNATIONAL DETOSITAIIY 
AUTHORITY: 

KCTC 082411 1» 


n . SCIl-NTll-lC DESCRIITION AND/OR PROPOSED TAXONOMIC DI-SKJNATION 


Tlic microorganism identified under I above was accompanied by: 

I X ] a scientific description 

( 1 a proposed taxonojnic designation 

(Mark with u cross where applicable) 


111. IIECI-IPT AND ACCEPTANCE 




This International Depositary Authority accepts tiie micnK)rBanism identified under I alxjve. 
which was received by it on July 11 2000. 


IV. RECKIII' OF REQUEST FOR CONVERSION 


'l"he microorganism ideniified under 1 above was 
Authority on and a request c 
under the i3udapest 'i'reaty was received by it o 


received by this Internal ional Depositary 
0 convert the original deposit to u deixisii 


V. INTERNATIONAL DEPOSITARY AUTHORITY 


Name: Korean Collection tor Type Cultures 

Address: Korea Research Institute of 
Bioscience and Biotechnology 
(KRIBB) 

fl52, Oun-dong, Yusong-ku, 
Taejon 305-3:i3. 
Republic of Korea 


Signature(s) of person(s) liaving the ijower 
to represent the International Depositary 
Authority of authorized officialls): 

BAE, Kyung Sook, Director 

Date: July 15 2000 ! 

i 
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wo 02/12509 



PCT/KROO/00819 



IN'l-EKNATIONAL K)HM 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT 

issutxt pursuiim to Rule V.] 

ID : CHOJ, Eui-Sung 

Dasol Apt. 102-S()7. #3a'i-3, Kuna-dnnK, Yusona-ku. Taejon :««-3:«>. 

Republic of Korea/ 



1 . IDENTIFICATION OF THE MICROORGANISM 



Idcntificalion reference given by the 
Dlil'OSlTOH: 



Escherichia coli 
miSO/IIpSED] 



Accession number givi:n by the 
INTERNATIONAL DI-ll'OSI I'AKY 
AUTHOIUTY: 

KCTC 0825BP 



n. SCIENTIFIC DESCRIPTION AN1)/0R PllOroSED TAXONOMIC DESIGNATION 



The microorganism identified under 1 above was accompanied by: 

[ X ] a scientific description 

[ 1 a in-oixjscd uxonomic designation 

(Mark with a cross where applicable) 



111. UECF.in' AND ACCEPTANCE 



This IniLTiiutional Depositary Authority accepts the microorganism identifiwl under 1 abovi;, 
which was received by it on July 11 2000. 



IV. HECI-II'T OF REQUEST FOR CONVERSION 



The microorganism identified under I above was received by this Internatinnul Deixjsitary 
Authurity on and a request to convert the original deixisil Ui a dcixtsii 

under the Budapest Treaty was received by it on 



V. INTERNATIONAL DETOSITARY AUTHORITY 



Name: Korean Collection for Type Cultures 



Address: Korea Research Institute of 
Bioscience and Biotechnology 
(KUIBB) 

#52, Oun-dong. Yusong-ku. 
Taejon 305-333. 
Republic of Korea 



Signatures) of person(s) having the |xiwi:r 
to represent the International Deposiutrv 
Authority of authorized officiaKs)- 

BAE, Kyung Sook. director 
Date: July 15 2000 



70 



wo 02/12509 
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INTERNATIONAL FOIIM 

RECEIPT IN TIIE CASE OF AN ORIGINAL DEPOSIT 

issuwl pursuant co Rule 7.1 



TO ■■ CHOI, Eui-Sung 



Dasol Apt. 102-507, #395-3, Kuntj-dong, Yusong-ku, Taejon 30!')-3.35, 
Republic of Korea/ 


I . IDKN ril-ICATION OF THE M1CR(X3RGANIS^4 


Idt'iiiilkation reference given by the 
DFJ'OSITOH: 


Accession number given by ihc 
INTERNATIONAL DKlt^l I-ARY 
AUTHORITY: 


Jlischerichia coli 
DlI5@/llpCWPl 


KCTC 0826BP 



D. SCJEN-J-ll-lC DESCilirnON AND/OR FlKM'OSEl) 'l AXONOMIC DKSKiNA'iKJN 



'l*hc micruurganism identified under 1 above Was accompanied by: 

[ X 1 a sciencinc description 

[ ] a pruix>sed taxonomic designation 

(Murk with a cross where applicable) 



ni, Hiicivirr and acci-i'tance 



'i'his IntcrnationaJ Depositary Authority accepts the microorganism ideniiricd under I atxjvu. 
which was received by it on July 11 2000. 



IV. IIECEU'T OF REQUEST FOR CONVERSION 



The microorganism identified under I above was received by this International Dei^ositary 
Authority on and a request to convert the original dei»«it tt> u deposit 

under the iiudapesi Treaty was received by it on 



V. INTERNATIONAL DEPOSITARY AUTHORITY 



Name: Korean Collection for Type Cultures 



Address: Korea Research Institute of 
Bioscience and Biotechnology 
(KRIBB) 

HSZ. Oun-dong, Yusong-ku, 
Taejon 305-333. 
Republic of Korea 



Signaiure(s) of personls) having the ijowei 
to represent the International Depositary 
Authority of authorized official(s): 



1^ 



BAE. Kyung Book, Director 
Date: July 15 2000 
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A. CLASSIFICATION OF SUBJECT MATTER 
IPC7 C12N 15/52 

According to International Patent Classification (IPC) or to both national classification and IPC 

B. FIELDS SEARCHED 

Minimun documentation searched (classiflcation system followed by classification symbols) 
1PC7 C12N 15/52 

Documentation searched other than minimun documentation to the extent that such documents are included in the fileds searched 



Electronic data base consulted during the intertnational search (name of data base and, where practicable, search trerms used) 
USPTO, NCBI pubmed 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Citation of document, with indication, where appropriate, of the relevant 



Relevant to claim No. 



US 6, 114, 1 47 Sep 5, 2000 (05. 09. 00.) 

US 5,223,408 June 29, 1993(29.06.93) 
US 5,223.409 June 29, 1993(29.06.93) 

US 5,602,034 Feb. 1 1. 1997 (1 1. 02. 97) 

Mol Cell Biol 2000 Mar ;20(9): 3245-55 

Appl Environ Microbiol 1998 Dec;64{12):4857-61 



1-10,11-14 
1-10,11-14 

1-10,11-14 
1-10,11-14 
1-10,11-14 



□ 



Further documents are listed in the continuation of Box C. 



I I See patent family annex. 



Special caiegorles of cited documeins: 

document defining the general state of the art which is not considered 
(o be of parlicular relevence 

earlier application or patent but published on or after the international 
filing date 

document which may throw doubts on priority clalm(s) or which is 
cited to establish the publication date of citation or other 
special reason (as specified) 

document referring to on oral disclosure, use, exhibition or other 



"T" later document published after the international filins date or priority 

date and not in conflict with the application but cited to understand 

the principle or theory undeitying the invention 
"X" document of particular relevence; the claimed invention cannot be 

considered novel or cannot be considered to involve an inventive 

step when the document is taken alone 
"Y" document of particular relevence; the claimed invention cannot be 

considered to involve on inventive step when the document is 

combined with one or more other such documents.such combimition 

being obvious to a person skilled in the ait 
"&" document member of the same patent fomily 
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